Gene TM1040_2195 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2195 
Symbol 
ID4078186 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2304591 
End bp2306279 
Gene Length1689 bp 
Protein Length562 aa 
Translation table11 
GC content63% 
IMG OID638007517 
Productband 7 protein 
Protein accessionYP_614189 
Protein GI99082035 
COG category[S] Function unknown 
COG ID[COG2268] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.807864 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGGAT CTTTCCTATT GGTGCCTGTC ATCAGTATCT TGGCGCTGGT TGCCCTCATC 
GGGCTTGTCC TTGGACGGCT CTATCGCCGG GCCACCCGTG AAGTCAGCCT GGTAAAGACC
GGCTCTGGCG GCAAAAAGGT CATTATGGAC GGCGGTACGG TTGTGGTTCC GCTGCTGCAT
GAAATCAGCC CGGTCAACAT GAAGACCCTG CGTCTGGAGG TGCAGCGCTC GGGTGAGGCG
GCACTCATTA CCCAGGACCG CATGCGGGTC GATGTGGGTG TGGAGTTCTA CGTCTCGGTG
ATGGCCACCG AAGAAGGGAT TTCGCGCGCG GCGCAGACGC TTGGGGACCG CACATTCGAT
GTCGAGCAGC TGCGCGAGAT GATCGAAGGC AAGCTCATCG ATGGTCTGCG CGCCGTGGCG
GCCCAGATGA CGATGGACGG GCTCCATGAA AACCGCGCCG ATTTTGTGCA GGAGGTGCAG
AATGCCGTCT CCGAGGATCT GCTGAAAAAC GGTCTGTCGC TTGAATCCGT CTCGCTGACC
GCGCTCGACC AGACACCCTT TGAGGCGCTG GATGAAAACA ACGCCTTCAA CGCGGTCGGT
ATGCGCAAGC TGGCAGAGGT GATTGCGACC TCCAAAAAAG AGCGCGCGCA GATCGACGCA
GAGGCAGAAG TCGCCGTGCG CCGCGCCGCA ATGGAAGCCG AGCGTCACAA GCTGTTGATC
GAGCAGGATG AACAGCAGGC CCGCATCGAG CAGATGCAAA AGGTCGAGAC CATGCGAGTC
GCCCAAGAGG CGGAGATCGC AGCCCGGACC GAGGACTCGG TGCGCGAAAC AGAACGCGCG
CGGATCGCCC GCGAAGAAGC CATCCGCGCC GCCGATATTG AGCGCGAGCG CAAGATCCGC
GAGGCCGAGA TCACCAAGGA GCGCGAACTG GAGGTGGCCG AGCAGGAACG CCAGATCATC
ATTGCGCAGA AATCCGAGGA AGAAAGCCGC GCCCGCGCCT CTGCCGACCT CGCCCGTGCC
GAGGCCATCA AGGCGACCGA GGCCGTCGCG ACCGCGCGTG AGGTGGCCGA GGCCGAGCGT
CAAAAGCAGA TTGTCCTCAT TGAGGCTGCG CGAGAGGCAG AGCGTCAAGC CACTGGCATC
CGTCTGGCCG CGCAGGCCGA AAAAGAAGCC GCCGCCGACC GCGCCGAGGC CCGTCGCGAG
GAAGCACAGG CCGAGGCAGA CGCGCTCAAT ATTCGCGCGG AGGCCAAGAA AAACGACATG
CTGGCCGAAG CGGAAGGTAA ACGCGCCCTT GTGGAGGCGG ACAATGCGCT CTCGCCGGAA
CTGGTGCGCA TGAAGGTTGA CCTCGCTCGC ATCGAGGCGA TGCCCTCGAT CATTGCAGAG
ATGGTGAAAC CGGCCGAGAA AATCGACTCG ATCAAGATCC ATCAGGTCGG TGGCGTGGGC
GGCGGCGCGG CCTACAGCAG CGCGGGCGCC TCTGGCGACA AACCCGTGGT CAATCAGGCG
CTCGATTCCA TCATGGGCAT GGCGGTGCAG ATGCCGGCGC TCAAAACACT GGGGCGTGAA
CTGGGGATCT CAATGGAGGA CGGCGTGTCC GGCGTGGTGA ACGGCATGCT GGAGGGCAAT
GACATCGCCC CCGAAGTCGC CGCCGACCCG GAGGCAACGG ATCAGGCGAA GACCTCAGAG
GTTCACTAA
 
Protein sequence
MDGSFLLVPV ISILALVALI GLVLGRLYRR ATREVSLVKT GSGGKKVIMD GGTVVVPLLH 
EISPVNMKTL RLEVQRSGEA ALITQDRMRV DVGVEFYVSV MATEEGISRA AQTLGDRTFD
VEQLREMIEG KLIDGLRAVA AQMTMDGLHE NRADFVQEVQ NAVSEDLLKN GLSLESVSLT
ALDQTPFEAL DENNAFNAVG MRKLAEVIAT SKKERAQIDA EAEVAVRRAA MEAERHKLLI
EQDEQQARIE QMQKVETMRV AQEAEIAART EDSVRETERA RIAREEAIRA ADIERERKIR
EAEITKEREL EVAEQERQII IAQKSEEESR ARASADLARA EAIKATEAVA TAREVAEAER
QKQIVLIEAA REAERQATGI RLAAQAEKEA AADRAEARRE EAQAEADALN IRAEAKKNDM
LAEAEGKRAL VEADNALSPE LVRMKVDLAR IEAMPSIIAE MVKPAEKIDS IKIHQVGGVG
GGAAYSSAGA SGDKPVVNQA LDSIMGMAVQ MPALKTLGRE LGISMEDGVS GVVNGMLEGN
DIAPEVAADP EATDQAKTSE VH