Gene Saro_1934 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_1934 
Symbol 
ID3917157 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp2048323 
End bp2049489 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID640444680 
Productphage integrase 
Protein accessionYP_497208 
Protein GI87199951 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCG GTCGCCGGTC GTGGACGGTA TTGGATGATG ACGGCGATGT CGTCGAGTGC 
CTTCGTCACT GGATCGTCCA TCTCGAACAG ACCCATGCGT CACCAAACAC GATCCGCGCT
TATGTTCGCC ACGTTGTGGA CTTCGCTAGC TTCCTCGGCG CAAACGGCGC CGGCATCCAT
GAAGCCACGG TTGCGCTGTA TGACAGCTTC CTTGCCTGGC GGCTTGCCCG CCGAAAGGAT
GCGCTGCCAA GTCCTCGGCT GATCCTGCTA CGCAAGCAGG AAACGCGGAT TCTGGCACCA
TCGACGCGCA ACCAGATCCA GCTCGCGGTC AAATCGTTCT ACCGCTTTTA CAACGGCACT
GACGACTTCG CGGTCGATAC GACCGAGGTC ACAAAGGCCT ATGACGGCCA CCGGATCTAC
AAGCCGTTCC TTGAGCATAT CAGCCAGCGA CGGACGACAC GGCGCAAAGA CCGTTATCTC
TCGGGCGATC CCGGCCGGGT CCAGCAGCAG GTGCTCAAGA AGCGGCTGAC GCCGAGCGAG
GTTCTGCGGC TGATCGAGGC CTGCGGGCTC GCGCGCGACG CCTTCCTGGT CGTGCTGCTC
TACAACACCG GCCTTAGGAT CGGTGAAGCG CTGGGCCTGC GCCATGTCGA TATCGATCTC
GCCGAAAAGG TCATCTGGGT CGTTCCGCGC GAAGACAATG CCAATGAGGC CCGTGCGAAA
TCGGGCCGGA CGCGCGGTGT GCCGGTGCAC GACTACGTGC TCAACATGTA CGTCGATTAC
ATCACCAGCG ACGAATATCT GCCAGCCTTC GAGTCCGGCG CCGAGTACGT CTTCACCAAT
GTCAAAGCCG GCGTCATCGG GCACGCCATG AGCCTGTCCT ACGCGCAGAA GCTCGCGGGC
CTGCTGGAGC AGCGCACCCA TATCGCCTTC AGCTGGCACA TGTTCCGCCA CAGCCATGCA
TCCGAGGCGA TCGCGGCGGG ATACAGCCTG CTCGAAGTGG CCGACCGGCT CGGGCATGCC
AGCCCGCAAA CGACAGCGGC GTTCTATCAG CACCTGTTCG CCTCGGAAAT CCGCAGGCTT
TACCTCACAG GACCCGACGA GGTGCATGAA AGGCTTGAGA AACTTCGCGA GGCTGAGCTG
CTCGGAAAGG ATCTGCGATG GGCCTGA
 
Protein sequence
MPGGRRSWTV LDDDGDVVEC LRHWIVHLEQ THASPNTIRA YVRHVVDFAS FLGANGAGIH 
EATVALYDSF LAWRLARRKD ALPSPRLILL RKQETRILAP STRNQIQLAV KSFYRFYNGT
DDFAVDTTEV TKAYDGHRIY KPFLEHISQR RTTRRKDRYL SGDPGRVQQQ VLKKRLTPSE
VLRLIEACGL ARDAFLVVLL YNTGLRIGEA LGLRHVDIDL AEKVIWVVPR EDNANEARAK
SGRTRGVPVH DYVLNMYVDY ITSDEYLPAF ESGAEYVFTN VKAGVIGHAM SLSYAQKLAG
LLEQRTHIAF SWHMFRHSHA SEAIAAGYSL LEVADRLGHA SPQTTAAFYQ HLFASEIRRL
YLTGPDEVHE RLEKLREAEL LGKDLRWA