Gene TM1040_0680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0680 
Symbol 
ID4077288 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp727602 
End bp728786 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content64% 
IMG OID638005977 
ProductHI0933-like protein 
Protein accessionYP_612675 
Protein GI99080521 
COG category[R] General function prediction only 
COG ID[COG2081] Predicted flavoproteins 
TIGRFAM ID[TIGR00275] flavoprotein, HI0933 family 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.192167 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCCA TGCGGATAAA AACCTTGATT CTCGGCGCCG GCGCGGCTGG CATGATGTGT 
GCAGCCCATG CAGGGGGCGA TTGCCTCGTG GTGGACCACG CCAAGTCCCC CGGAGAGAAG
ATCCGCATCT CCGGCGGCGG GCGCTGCAAT TTTACCAATA TGTATGCCGC ACCCGAGAAT
TACATCTCGC AGAACCCGCA TTTCTGTAAA TCTGCGCTGG CCCGCTACAC GCAATGGGAT
TTCATTGACC TCGTGGGCCG TCATGGCATC GCGTGGCATG AAAAAACGCT TGGCCAGCTC
TTTTGCGATG ACTCCGCACG CCAGATCGTC GCGATGCTGG TCAAAGAATT GCGCGACGCC
GGGGCTGACC TGTGGTTGCA GACCTCGGTC GCGGATGTGG TGCATGGCCG TGACGGATAC
ACCGTCCGCC TCGAGCGCGA GGGCAAGCCC GTGACGATCA CGGCTCAGAA CCTCGTGCTG
GCAACCGGTG GCAAATCGAT CCCCAAGATG GGCGCGACGG GTCTTGCCTA TGACATCGCG
GGGCAGTTTG GGCTGCCCGT CCTTGAGACC CGCCCCGGGC TTGTTCCCCT CACCTTTGGC
GAGGGGCGTT TCAAACCTTT GGCCGGGGTC TCGGTGCCCG CACGGCTCTC CAATACTGCG
GCCAGTTTTG ACGAGGCGCT GCTCTTCACC CATCGGGGCC TCTCTGGACC GGCGGTTCTG
CAGATCTCGA GCTATTGGCG CGAAGGAGAG GACATCTTGG TCCACCTGCT GCCGGAACTG
GATCTTTTTT CGGCCCTGCG CGCGCAACGT CAGGAAAGCG GGCGCAAGGA TCTGACAACC
GAACTGGCGC GCCACCTGCC TGCACGGTTG GTGGAGGAGC TGGCGCAGGA CGGCAGCCTC
AGGGGGCGTT TGGCCGATCA GTCCGATGCA GCGCTCGAAG CCCTCTGCGC GCGGCTGCAC
AGTTGGCGAC TGAAGCCCAC CGGCACCGAG GGCTATCGCA CCGCCGAAGT GACGCTGGGC
GGGATCGACA CCGATGCACT GTCGTCGCGC TCGATGGAGG CCAAGGCGCA GCCCGGCCTC
TATGTAATCG GCGAAGCGGT GGACGTGACC GGCTGGCTCG GCGGCTATAA CTTCCAGTGG
GCCTGGGCGT CGGGCCACGC CGCAGGCACC GCCATTCGGG GCTGA
 
Protein sequence
MRPMRIKTLI LGAGAAGMMC AAHAGGDCLV VDHAKSPGEK IRISGGGRCN FTNMYAAPEN 
YISQNPHFCK SALARYTQWD FIDLVGRHGI AWHEKTLGQL FCDDSARQIV AMLVKELRDA
GADLWLQTSV ADVVHGRDGY TVRLEREGKP VTITAQNLVL ATGGKSIPKM GATGLAYDIA
GQFGLPVLET RPGLVPLTFG EGRFKPLAGV SVPARLSNTA ASFDEALLFT HRGLSGPAVL
QISSYWREGE DILVHLLPEL DLFSALRAQR QESGRKDLTT ELARHLPARL VEELAQDGSL
RGRLADQSDA ALEALCARLH SWRLKPTGTE GYRTAEVTLG GIDTDALSSR SMEAKAQPGL
YVIGEAVDVT GWLGGYNFQW AWASGHAAGT AIRG