Gene TM1040_2837 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2837 
Symbol 
ID4076656 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp3004905 
End bp3006122 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content61% 
IMG OID638008166 
Producthypothetical protein 
Protein accessionYP_614831 
Protein GI99082677 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.839793 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGCGCAA CTGAACACCC GGCACCGACC ACGCCTCCCT CGCCGGAGGG CGGCTGGTCG 
GAAAGCATCT CGGTGCTGCG CAATGTGACG GTGGTGCCTC CGGTCGAGAG CAACCTCGTG
CAGGCCGCCG GTCTTTTGCG CGAGGACGGC AGCTATTGTG CTGAGGGTGC CTTGTGGCGC
AGGCATCGAC CCATCACGAC AGAACCTCCA AAACCCTCGG AGTTTGCGGA AAAAATCTCT
GGACGTTGGC TGTGGGGTGG CGTGCTATGG GCGCATTTCG GGCACTTTCT GGTCGAAAGC
ACCGCCCGTC TCTGGGCGCT GTCAGAACTG GATGCGCCGG TGGATGGGGT GTTGTTCATT
CCAAAACGCC CCGCCGTCAG AGATCAGGTG CGCGGGTTTC AGGCCGAATT CGTCGATCTC
ATGCAGAGGG ACCTGCCAAT CCGCGTTGCA GCGGATCCGT CTCTGGTTGA GGAACTTGTG
ATCCCCGGGC AGGGGTTTGG CCTTGGGAGG ATTACCGAGG CAACGCCCAA GTACCGCAAC
GCGATCCATG CCCGTTTTGC GCGCGACATC AAACCCGAGG GGCCGGAGAA GATCTACATC
TCACGCTCCA AGCTGGGGCT CGGCAAGGGC GGGCTGTTGG GCGAAGAGCA GATGGAAGCC
TTCCTCGCGG CGGAGGGCTA CGAGATTTTC CACCCACAGG AACATACCCT GTCGGAGCAG
CTGGCGCGCT ACAAGGCGGC GCGCAAGGTG ATCGCGGCTG ATGGTTCCGC GCTGCATCTT
TATGCAATGG TGGGGCGGCC CGATCAGAAG GTTGCGATGG TTCTGCGGCG CAAATCCACC
GCGCATACGC TGTTGACCGA CAACGTACGT TACTTCTGCA AGTGCGACCC CTTGGTGATT
GGTGCATTAC GCACGGAATG GGTGCCCAAG AACAATCAAC GCTCCAGCCG TCTGAGCTTT
GGGGAACTGG ATCATTCTGT TATCGGCCGG GCGCTCCACG AGGCAGGCTT TATTTCGGGT
GGGAAAAACT GGCCGGTGCT GGATGACGCC GCGCGCAATC AGGTGCTCAA AGACAAAGGC
ATTAAAAGTG ATCGCTTTGT CGAGTCCCCC GCGTTTCGCA AGGCGCGCGA GGAAAAGGAA
CGGGCGGAGC GTCGCGCCCG TCGCGCAGCA AGACACGCCC GCAGGCAGGC TCGCGCCGCT
GCGCAAAACG ACGGCTAA
 
Protein sequence
MCATEHPAPT TPPSPEGGWS ESISVLRNVT VVPPVESNLV QAAGLLREDG SYCAEGALWR 
RHRPITTEPP KPSEFAEKIS GRWLWGGVLW AHFGHFLVES TARLWALSEL DAPVDGVLFI
PKRPAVRDQV RGFQAEFVDL MQRDLPIRVA ADPSLVEELV IPGQGFGLGR ITEATPKYRN
AIHARFARDI KPEGPEKIYI SRSKLGLGKG GLLGEEQMEA FLAAEGYEIF HPQEHTLSEQ
LARYKAARKV IAADGSALHL YAMVGRPDQK VAMVLRRKST AHTLLTDNVR YFCKCDPLVI
GALRTEWVPK NNQRSSRLSF GELDHSVIGR ALHEAGFISG GKNWPVLDDA ARNQVLKDKG
IKSDRFVESP AFRKAREEKE RAERRARRAA RHARRQARAA AQNDG