Gene TM1040_2095 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2095 
Symbol 
ID4077846 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2199768 
End bp2202122 
Gene Length2355 bp 
Protein Length784 aa 
Translation table11 
GC content63% 
IMG OID638007414 
Producthypothetical protein 
Protein accessionYP_614089 
Protein GI99081935 
COG category[N] Cell motility 
COG ID[COG1360] Flagellar motor protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCCTCA AGCGGCGCAC TGGTCAACGG TTTCAAGGCT CGATCTGGCC CGGTTTCGTG 
GATGCGATGA CCGGACTTTT GCTGGTTTTG ATGTTTGTTC TCACCATTTT TATGGTGGTG
CAATTTGTGC TGCGCGAGAC GATCTCGGGC CAGGAATCCG AGCTTGATGA ACTCTCGACC
GAGGTGCGGG CGCTGGCAGA GGCACTTGGC GTCAAGGAAC GCGAGGCCAG CCAGTTGCAG
GCCCGACTGG GCGCGCTGGG GGCGACGCTG TCCTCAACGC GCTCTGATCT TGATGCCGCG
CGAGAGCAAA TTTCGAACCA GACCGCGCGG ATCAGCGCCC TGACGCAGGA ACGTGATGCC
GCGAGATCGG ATCTTGCAAC GGCCCGCACG CAGATTTCCG ACTTTGAGGC GCAGGTGGCG
GCTCTGATCG CTGGTCGTGA AAGTGCCGAG GCGCAGATTG CCGATCTCAC CGCCGAGCGG
GACGCGCTTG ATGCGGCACG CAGCGAGTTG CTCTCAGAAC AGGAGGCGCT GAACCTTGCA
TTGGCCCAAC TGCGCGAAGA GGTGGACGCG GAGGCCGAAG CCGCCCGTCT TGCCGCCGCT
CAGACCGAGG CCCTGCAGGC GCTGGTGGAG GATCTGCGCG CAGAAGGGGC GGCACAGTCT
GAACGCGTGA GTGCGCTCGA AGAGGCGCTC TCTGAAGAAG AGGCGACCCG GCTGGCTGAG
GCCGCTGCGG CAGAGGCCCT GCGCGCGCGG TTGGAAAACG CGGATGCCGA ACTCACTGCA
ATGACCCTTG CACTGGAAGA AGAGCGCAAG AAGGCCGAGG ACACGCTGAC GCTTCTGGCG
GCTGCGGAGG CGGCGCGCGA CCAGCTTGAT ACGGAGCTTG AAGAGGCCTT GGCCGCGATT
GAGCAAGCTA AGGCGCAGGT CAATGATCGC GACGAGTTGG CAGAGCGCCT GACCCGCGTT
CTGGCCCAGA TGGAAGTCAC TGAAAGCACG GCAACGGCGC GAGTCTCGGC GCTTGAAGCG
GAGCTGGAAC GCGTCAGAAA CGAGAATGCT GCAACCCGCG AGCGTATGAC TGCAGACTTG
GAGACGGCGC GCCAGGAGGC CGCAGATACG CGCAGCCGCC TTGAGGCGGA GCTGACCCGA
CAGCGGGCGC AGACGGTGGA AACCGAGAGC CAATATCAGG CCCAACTGCG CGCCGCCCAA
GAGAGTTTTG ACGCCGAGCG CCGCGCGTTG GAAGATCGCC TTGCGACCCT CGAGGCACAG
GCCGATACCA CGCGGCGTGA CCTGGAAGAC CAGCTTGCCG CGCTGCGTGG TCAGGCAGAA
GAAACCCGCA GTGGGCTTGA AACGCGTTTG GCGCGTGCCG AGGCCGATCT GGCTGCAGCC
CGCGCGGCTG CGAGTTCCAC CGCAGAGGAA CGCGCTTCGG TTGAACAGCG TTTGCTCATC
GCGCTTGAGG CGCTGGAACG GGCGCAGGCT GCGGCAAGCG ATCAGGAGGT TCTGCAGAGC
CGTCTTCTGG CCGCCTTGGC GCAAAAAGAT GATTTTGCAC AGGAGATTTC CGAACAGCGC
ACCCTCGCAG AACAGCGCGC GGATCTCTTG GCGCAGGCCC GCGCTGCACT TGCGGAGGAA
AAACAGATCT CGGAAGACGC CCGCCGCGAG ACGGCCTTGC TCAATCAGCA GGTGGCGGCG
CTGCGTGAAC AGCTTGGTGG GCTGCAGTCG CTCCTGGACG ATTTCAAAGA ACGGGACGCA
GCACAGGGAA TTCAGCTCCA GAGCCTGGGT CAGGATCTCA ATACAGCGCT TGCGCGTGCC
GCCGCTGAAG AGCGCCGCCG CCGCATGCTC GAAGAACAGG AGCGCAAACG TCTTGAGGCA
GAGCGCGAAC GTCTCGCCAA TGAAGCCAAG GATCTGGAGC AATATCGCTC CGAGTTCTTT
GGTCAGTTGC GCAGCGTCCT GGGCAATCAG GAAGGTGTAC GCATCGAAGG CGACCGTTTT
GTCTTTGCCT CCGAGGTGCT GTTTGCACTG GGAAGCGCCG AGCTCTCAGA GGCCGGCAAG
GCCGAAATCG CCAAGGTGGC GCGCATCCTG CAAAACGTCG CCGCCGCCAT CCCGGATGAC
ATCAACTGGA TCATCCGTGT GGATGGGCAC ACGGACAACC AGCGCTTTGT TGGGGCGGGC
AAATACGCCG ACAACTGGGA GCTGAGCCAG GGCAGGGCGC TTTCGGTTGT GCGCTACATG
ATTGATGAGC TGGGCATCCC CCCGGGACGT CTTGCGGCCA ACGGATTTGG CGAGTTCCAG
CCGGTCAATC CAGCTGACAC GCCTGAGGCG CGCGCGCAGA ACCGCCGAAT CGAATTGAAG
CTTACGGAAC GCTGA
 
Protein sequence
MALKRRTGQR FQGSIWPGFV DAMTGLLLVL MFVLTIFMVV QFVLRETISG QESELDELST 
EVRALAEALG VKEREASQLQ ARLGALGATL SSTRSDLDAA REQISNQTAR ISALTQERDA
ARSDLATART QISDFEAQVA ALIAGRESAE AQIADLTAER DALDAARSEL LSEQEALNLA
LAQLREEVDA EAEAARLAAA QTEALQALVE DLRAEGAAQS ERVSALEEAL SEEEATRLAE
AAAAEALRAR LENADAELTA MTLALEEERK KAEDTLTLLA AAEAARDQLD TELEEALAAI
EQAKAQVNDR DELAERLTRV LAQMEVTEST ATARVSALEA ELERVRNENA ATRERMTADL
ETARQEAADT RSRLEAELTR QRAQTVETES QYQAQLRAAQ ESFDAERRAL EDRLATLEAQ
ADTTRRDLED QLAALRGQAE ETRSGLETRL ARAEADLAAA RAAASSTAEE RASVEQRLLI
ALEALERAQA AASDQEVLQS RLLAALAQKD DFAQEISEQR TLAEQRADLL AQARAALAEE
KQISEDARRE TALLNQQVAA LREQLGGLQS LLDDFKERDA AQGIQLQSLG QDLNTALARA
AAEERRRRML EEQERKRLEA ERERLANEAK DLEQYRSEFF GQLRSVLGNQ EGVRIEGDRF
VFASEVLFAL GSAELSEAGK AEIAKVARIL QNVAAAIPDD INWIIRVDGH TDNQRFVGAG
KYADNWELSQ GRALSVVRYM IDELGIPPGR LAANGFGEFQ PVNPADTPEA RAQNRRIELK
LTER