Gene TM1040_0103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_0103 
Symbol 
ID4078688 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp109602 
End bp110837 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content61% 
IMG OID638005390 
Producthypothetical protein 
Protein accessionYP_612098 
Protein GI99079944 
COG category[S] Function unknown 
COG ID[COG1322] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCA TGGAGATTGC CATCGCCAAT ATGAGCGCCG CAGGACCCGT CCTTTGGGCC 
GCCGGCGCAG GCATGCTTTT GGTGCTTGTG TTGCTGTTTC AATCGTGGCG CACGTCGGCG
CGCACCGCCC GCGCACTGGA GCCCCTGAGC CAGCAGATGC ACACGCTCGG GCATGTGGCG
CAGCAGCTCT CTGCCGGGCA GGACGCCTTG CGCGGCAACT TGCAGACAGT GTCGGACACT
CAGGCGCATG CGCAGATGCA GATCCTTCAG ACCATGGAAG CGCGCCTTGG AGATGTGCAA
CAGCGGATGA ACGACCGGCT GGCAGAAAAC GCGATGAAAC AGGCGCGCGC AATGTCCGAG
ATGCAGGAGC GCATGGCCGA GAGCCTGCAC GGGAATGCCA AACGCACTGC TACCTCGCTC
ACCCAGTTGC AAGAACGGCT TGCGGTGATC GACAAGGCGC AGGACAATAT CACGAAGCTC
TCGGGCGATG TGCTTTCGCT GCAGGATATC CTGTCAAACA AGCAGACGCG GGGCGCCTTT
GGTGAGATCC AGTTGAATGA CATTGTCTCA AAGGCGCTGC CGAGCGATTC CTATGCATTC
CAACACACGC TCTCCAATGG CAAACGCGCG GACTGCCTGA TCCACTTGCC CAACCCGCCC
GGGCCCATCG TGATCGACAG CAAGTTCCCG CTCGAGCCCT ATGAAGCGCT GCGCGGCGCT
GAAACCCAGG AGGCGCGCGC CCAAGCGGCC CGGCTCCTTA AGGGCGCGCT GCGCAAACAT
ATCCGAGACA TCGCAGAGAA ATATATCCTC GAAGGGGAAA CCGCCGACGG GGCGCTGATG
TTTCTGCCTT CGGAGGCGGT CTATGCAGAG CTACACGCGA ATTTCTCGGA TGTGGTGCGC
GAAGGGTTCT CGCTCAAGGT CTGGATCGTC TCGCCCACCA CATGCATGGC GACGCTGAAC
ACGATGCGGG CGATCCTGAA AGATGCCCGC ATGCGCGAAC AGGCGGGCGC CATTCGCCAG
GAACTGGGTC TGCTGCACAA GGATGTTGAA CGTCTCGGCG ACCGGGTGGG CAATCTCGAT
CGGCATTTCG CGCAAGCCCA ACGGGATATT TCCGATATCA AGATCAGCGC CGACAAGGCT
GGGCGACGCG CCCAGCGGCT AGATAATTTT GACTTTGAGG ACCTTAACCC AGAGAGCGTA
TCGCGGGTTG TTGCCCTGGA GCACCCGGGC GAATGA
 
Protein sequence
MDGMEIAIAN MSAAGPVLWA AGAGMLLVLV LLFQSWRTSA RTARALEPLS QQMHTLGHVA 
QQLSAGQDAL RGNLQTVSDT QAHAQMQILQ TMEARLGDVQ QRMNDRLAEN AMKQARAMSE
MQERMAESLH GNAKRTATSL TQLQERLAVI DKAQDNITKL SGDVLSLQDI LSNKQTRGAF
GEIQLNDIVS KALPSDSYAF QHTLSNGKRA DCLIHLPNPP GPIVIDSKFP LEPYEALRGA
ETQEARAQAA RLLKGALRKH IRDIAEKYIL EGETADGALM FLPSEAVYAE LHANFSDVVR
EGFSLKVWIV SPTTCMATLN TMRAILKDAR MREQAGAIRQ ELGLLHKDVE RLGDRVGNLD
RHFAQAQRDI SDIKISADKA GRRAQRLDNF DFEDLNPESV SRVVALEHPG E