Gene Saro_3970 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_3970 
Symbol 
ID5077500 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_009426 
Strand
Start bp138836 
End bp140002 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content62% 
IMG OID640481076 
Productphage integrase family protein 
Protein accessionYP_001165738 
Protein GI146275577 
COG category[L] Replication, recombination and repair 
COG ID[COG4974] Site-specific recombinase XerD 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGGCG GTCGCCGGTC GTGGACGGTA TTGGATGATG ACGGCGATGT CGTCGAGTGC 
CTTCGTCACT GGATCGTCCA TCTCGAACAG ACCCATGCGT CACCAAACAC GATCCGCGCT
TATGTTCGCC ACGTTGTGGA CTTCGCTAGC TTCCTCGGCG CAAACGGCGC CGGCATCCAT
GAAGCCACGG TTGCGCTGTA TGACAGCTTC CTTGCCTGGC GGCTTGCCCG CCGAAAGGAT
GCGCTGCCAA GTCCTCGGCT GATCCTGCTA CGCAAGCAGG AAACGCGGAT TCTGGCACCA
TCGACGCGCA ACCAGATCCA GCTCGCGGTC AAATCGTTCT ACCGCTTTTA CAACGGCACT
GACGACTTCG CGGTCGATAC GACCGAGGTC ACAAAGGCCT ATGACGGCCA CCGGATCTAC
AAGCCGTTCC TTGAGCATAT CAGCCAGCGA CGGACGACAC GGCGCAAAGA CCGTTATCTC
TCGGGCGATC CCGGCCGGGT CCAGCAGCAG GTGCTCAAGA AGCGGCTGAC GCCGAGCGAG
GTTCTGCGGC TGATCGAGGC CTGCGGGCTC GCGCGCGACG CCTTCCTGGT CGTGCTGCTC
TACAACACCG GCCTTAGGAT CGGTGAAGCG CTGGGCCTGC GCCATGTCGA TATCGATCTC
GCCGAAAAGG TCATCTGGGT CGTTCCGCGC GAAGACAATG CCAATGAGGC CCGTGCGAAA
TCGGGCCGGA CGCGCGGTGT GCCGGTGCAC GACTACGTGC TCAACATGTA CGTCGATTAC
ATCACCAGCG ACGAATATCT GCCAGCCTTC GAGTCCGGCG CCGAGTACGT CTTCACCAAT
GTCAAAGCCG GCGTCATCGG GCACGCCATG AGCCTGTCCT ACGCGCAGAA GCTCGCGGGC
CTGCTGGAGC AGCGCACCCA TATCGCCTTC AGCTGGCACA TGTTCCGCCA CAGCCATGCA
TCCGAGGCGA TCGCGGCGGG ATACAGCCTG CTCGAAGTGG CCGACCGGCT CGGGCATGCC
AGCCCGCAAA CGACAGCGGC GTTCTATCAG CACCTGTTCG CCTCGGAAAT CCGCAGGCTT
TACCTCACAG GACCCGACGA GGTGCATGAA AGGCTTGAGA AACTTCGCGA GGCTGAGCTG
CTCGGAAAGG ATCTGCGATG GGCCTGA
 
Protein sequence
MPGGRRSWTV LDDDGDVVEC LRHWIVHLEQ THASPNTIRA YVRHVVDFAS FLGANGAGIH 
EATVALYDSF LAWRLARRKD ALPSPRLILL RKQETRILAP STRNQIQLAV KSFYRFYNGT
DDFAVDTTEV TKAYDGHRIY KPFLEHISQR RTTRRKDRYL SGDPGRVQQQ VLKKRLTPSE
VLRLIEACGL ARDAFLVVLL YNTGLRIGEA LGLRHVDIDL AEKVIWVVPR EDNANEARAK
SGRTRGVPVH DYVLNMYVDY ITSDEYLPAF ESGAEYVFTN VKAGVIGHAM SLSYAQKLAG
LLEQRTHIAF SWHMFRHSHA SEAIAAGYSL LEVADRLGHA SPQTTAAFYQ HLFASEIRRL
YLTGPDEVHE RLEKLREAEL LGKDLRWA