Gene TM1040_3014 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3014 
Symbol 
ID4076587 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3181937 
End bp3183130 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content58% 
IMG OID638008343 
Producthypothetical protein 
Protein accessionYP_615008 
Protein GI99082854 
COG category[R] General function prediction only 
COG ID[COG0820] Predicted Fe-S-cluster redox enzyme 
TIGRFAM ID[TIGR00048] radical SAM enzyme, Cfr family 


Plasmid Coverage information

Num covering plasmid clones30 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.327469 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACAGCCT CAGCGCCGAT CACCCAAGAT GTTTTGACCC TCCCCCGCAA AGAGCCGGAA 
GGCGGCAAGA TCAACCTTGT TGGTCTGACC CGTGACCGCA TGCGCGCGGT ATTGATCGAA
AACGGCACCC CGGAGAAACA GGCCAAGATG CGCGTCGGGC AGATCTGGCA GTGGATCTAC
CAATGGGGGG TACGAGACTT TGCGGAGATG ACAAATCTAG CCAAGGCCTA CCGCGCTCAG
CTGGAGGAAA CATTCGAGAT CCGCATCCCC GAGGTGGTCT CAAAACAAGT GTCGACCGAT
GGCACGCGCA AATATCTGGT GCGGATAAAT GGCGGCCATG AGGTTGAGGT GGTCTATATC
CCCGAGGACG ACCGGGGCAC CTTATGCATT TCCTCTCAGG TCGGCTGTAC GCTCACCTGT
TCGTTTTGCC ACACCGGCAC GCAAAAGCTG GTGCGCAACC TGACCCCGGC CGAAATCATC
GGACAGGTGA TGATGGCGCG GGATGACCTG GAAGAATGGC CCACCCCCGG CGCGCCAAAG
GATGAAACCC GCCTACTGTC CAACATCGTT CTGATGGGCA TGGGGGAGCC GCTTTATAAT
TTCGACAATG TGCGCGATGC GATGAAGATT GCGATGGACC CGGAGGGGAT TTCCCTCTCG
CGGCGTCGTA TCACGCTCTC GACCTCTGGC GTGGTGCCCG AGATTGCGCG GACGGCTGAG
GAAATCGGCT GTCTCCTTGC GATATCCTTT CATGCGACCA CCAATGAGGT GCGCGATGTG
CTGGTTCCGA TCAACCGTCG CTGGAACATC GATGAATTGC TGCAGGCGCT TGCAGATTAC
CCGAAGGTCT CGAACTCTGA GCGGATCACC TTCGAATATG TGATGCTTGA TGGGGTGAAC
GACTCTGATG AGGACGCACA TCGTCTTCTG GATCATATCA AGCGCCACAA CATTCCGGCC
AAGATCAACC TCATTCCCTT TAATGAGTGG CCGGGGGCGC CCTATAAACG GTCGTCCAAC
AACCGCATCC GGGCGTTTGC AAATATCATC TATCAGGCTG GCTATGCCTC GCCGATCCGC
AAGACCCGCG GCGATGATAT CATGGCCGCC TGCGGTCAGC TCAAGTCTGC CACGGAGCGG
GCCCGCAAGA GCCGCAAGCA AATCGAAGCC GAGGCCGGAG TGAACAACAG CTGA
 
Protein sequence
MTASAPITQD VLTLPRKEPE GGKINLVGLT RDRMRAVLIE NGTPEKQAKM RVGQIWQWIY 
QWGVRDFAEM TNLAKAYRAQ LEETFEIRIP EVVSKQVSTD GTRKYLVRIN GGHEVEVVYI
PEDDRGTLCI SSQVGCTLTC SFCHTGTQKL VRNLTPAEII GQVMMARDDL EEWPTPGAPK
DETRLLSNIV LMGMGEPLYN FDNVRDAMKI AMDPEGISLS RRRITLSTSG VVPEIARTAE
EIGCLLAISF HATTNEVRDV LVPINRRWNI DELLQALADY PKVSNSERIT FEYVMLDGVN
DSDEDAHRLL DHIKRHNIPA KINLIPFNEW PGAPYKRSSN NRIRAFANII YQAGYASPIR
KTRGDDIMAA CGQLKSATER ARKSRKQIEA EAGVNNS