Gene TM1040_2134 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_2134 
Symbol 
ID4076448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008044 
Strand
Start bp2239477 
End bp2241516 
Gene Length2040 bp 
Protein Length679 aa 
Translation table11 
GC content63% 
IMG OID638007454 
Productcapsule polysaccharide biosynthesis 
Protein accessionYP_614128 
Protein GI99081974 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3563] Capsule polysaccharide export protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGCAAG GGCCTTTGCC GCATCTGGAC GAGACCAGAA CCGGGAGCAA ACGCTCCCGG 
TTGTTTGTGT ATAACGGAGG GTTTCTGACC CAGTCCCGTG TCCGGCGCAT TCTTGATCTA
GCGGGCTATG ACATTCGTCT GGGCCTGCCC AAAGAGGGCG ATCTGGTCGG GATCTGGGGC
AACAGCCCCA CCGCCCATCG CGGCCGCAAG ATTGCCCAAG AGCATAACGC GCCCTTGGTG
CGCATTGAGG ATGCGTTCCT GCGCTCGGTG CATCCGGGCC GCGATGGCGA GCCCCCGATG
GGATTGCTGA TTGACCGCGC TGGCGTGCAT TTTGACCCGG CCCTTCCAAG CGACCTCATC
ACGCTCCTGA AAGAACACCC ACTCGATGAC AGCGCCTTGA TGAAGCGCGC GCGCGACGCC
ATGGCAAGGC TTCAGGACGC GCATCTGAGC AAATACAACG CCTTCCATCC CGATGTTCCC
CCACCAGAAC CCGGCTACGT GCTGGTGGTG GATCAATTGC GCGACGATGC GTCCGTCAAG
GCCTCGATCC CCTTCCCCGG CGCGGACCGG GGACGCTTTC AGGAAATGCT GGCCTTTGCG
CGCGAAGAAA ACCCCGGCGC GCGCGTATTG ATCAAGACCC ATCCGGAAAC CACCAAGGGC
CACCGTGCCG GCTATTTCGA CGCGCGTGAC ACCAATGATC AGGTGGAACT CTTTGATGCG
CCAGTCTCGC CACATCTGCT GCTCGAGGGC GCTGTGGGCG TCTATACCGT GTCGTCGCAA
CTCGGGTTCG AGGCCATCCT TTCCGGGCAC AGGCCGCGTA TCTTTGGCCA GCCTTTCTAT
GCCGGATGGG GCCTGACGCA GGATGAGTTC CCCCCCGCGG GGCGTCATCG TCGCCTGACC
CGCGCACAGC TTTTTGCGGC GGCCATGATC CTCTTTCCCA CCTGGTATGA TCCCCATCAC
GACCGGCTTT GCGAGCTGGA GGACGTGATC GACTCTCTGG AGGCGCAGGT CCGGGCCTGG
CGCGAAGACC GCGCTGGCTG GGCTGCCTCA GAGATGCGTC TGTGGAAGCG CGCGCCCTTG
CAGCAGTTCT TCGGGCGCCA CAAGCGCATG AGTTTCACCG AAAAAACCGC AACTGCCAAA
AAGAGCGGCA AGAACTGGAT GGTCTGGGCC AGCAAGGCGA CCAAGGATCA CGCCGGGGCG
CATCATCTGG AAGATGGATT CTTGCGCTCG CGGGGGCTCG GGGCTGAACT GGTGCCGCCC
TTGTCATTGG TGCTGGATCG TCAGGGCATT TATTATGACC CGACCCGTCA CAGTGATCTG
GATGACCTGA TCCGCGAACG CGTCACGCTG AGCCCGCAGC AGGAACGCCG GATCGAGGCG
CTGGTGATGC GGCTCATCAA ACATGAGGTG ACAAAATACA ATCTCGATGG CGCACTGCCC
GATTTGCCCA AAGGGCATCG GGTTCTGGTG CCCGGCCAGG TCGAGGATGA TGCCTCCCTG
CGGCTCGGAG CGGGTAAGAT CAACACCAAT ATGAAGCTGT TGCAGGCCGT GCGGGCTGCG
CGCCCGAATG CGGTCGTGAT TTACAAACCG CACCCGGATG TGGAGGCAGG CCTGCGCAAG
GGCCGCATCA GCCACGCCGA GGCCTGGGCG GATGTGGTGG CAGAGCATGC CAACCCGGCG
GCGCTGATCG ACAGTGTCGA TGAGGTCTGG ACCATGACGT CGCTCCTGGG GTTTGAGGCT
TTGTTGCGGC GGGTGCCGGT CACCTGTGTG GGGCTGCCGT TTTATGCTGG ATGGGGGCTG
ACGCGGGATC GGCTTCAGGC GCCCCATTGG CGCGATGCGC GCCCTGGTAT CCTTGGGCTC
GCCCATGCGG CGCTCATCGA CTATCCGCGC TATTTTGATC CGGTCTCAAA GCTGCCCTGC
GCGCCCGAGG TTGCTGTCGA TCGGCTGATT GCCGGAGAGC TGCCTGCGCG CAGCCCCCTG
AACAGCAGCC TCTCGAAATT GCAGGGGCTC TTTGCCTCAT TTGCTCCGCT CTGGCGCTGA
 
Protein sequence
MMQGPLPHLD ETRTGSKRSR LFVYNGGFLT QSRVRRILDL AGYDIRLGLP KEGDLVGIWG 
NSPTAHRGRK IAQEHNAPLV RIEDAFLRSV HPGRDGEPPM GLLIDRAGVH FDPALPSDLI
TLLKEHPLDD SALMKRARDA MARLQDAHLS KYNAFHPDVP PPEPGYVLVV DQLRDDASVK
ASIPFPGADR GRFQEMLAFA REENPGARVL IKTHPETTKG HRAGYFDARD TNDQVELFDA
PVSPHLLLEG AVGVYTVSSQ LGFEAILSGH RPRIFGQPFY AGWGLTQDEF PPAGRHRRLT
RAQLFAAAMI LFPTWYDPHH DRLCELEDVI DSLEAQVRAW REDRAGWAAS EMRLWKRAPL
QQFFGRHKRM SFTEKTATAK KSGKNWMVWA SKATKDHAGA HHLEDGFLRS RGLGAELVPP
LSLVLDRQGI YYDPTRHSDL DDLIRERVTL SPQQERRIEA LVMRLIKHEV TKYNLDGALP
DLPKGHRVLV PGQVEDDASL RLGAGKINTN MKLLQAVRAA RPNAVVIYKP HPDVEAGLRK
GRISHAEAWA DVVAEHANPA ALIDSVDEVW TMTSLLGFEA LLRRVPVTCV GLPFYAGWGL
TRDRLQAPHW RDARPGILGL AHAALIDYPR YFDPVSKLPC APEVAVDRLI AGELPARSPL
NSSLSKLQGL FASFAPLWR