Gene TM1040_3303 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTM1040_3303 
Symbol 
ID4075707 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameRuegeria sp. TM1040 
KingdomBacteria 
Replicon accessionNC_008043 
Strand
Start bp310065 
End bp311162 
Gene Length1098 bp 
Protein Length365 aa 
Translation table11 
GC content59% 
IMG OID638004811 
ProductABC transporter related 
Protein accessionYP_611537 
Protein GI99078279 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3839] ABC-type sugar transport systems, ATPase components 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.700812 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.735765 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGAATT TGAAACTGAC CGGGGTAGAG AAAACCTACG CTGGCGCGGT GAATGTCCTG 
AGAGACATCA ATCTCGACAT CAAGCAGGGG GAGTTGATTG TCTTTGTGGG TCCGTCGGGA
TGCGGCAAGT CCACGCTCCT GCGCATGATC GCGGGGCTGG AACGCATAAC CGGTGGGACA
TTGGAAATCG ACGGCGCGGT GATGAATGAC ATCCCGCCCG CCCAGCGGGG CATCGCCATG
GTGTTCCAGA GCTACGCGCT CTATCCACAT ATGACGGTGC GCGACAACAT GGGCTTTGCG
CTCAAGATCG CAGGTAAAAG CGCAAGCGAG ATTGAGGAGG CGATCACACG AGCAGCCAAG
ATCCTGCAGC TTGAGGACTA TCTCGACCGT CTGCCCAAGG CGCTTTCAGG TGGGCAGCGG
CAGCGGGTGG CAATCGGTCG GGCGATTGTG CGCGATCCAA AGGTCTATCT CTTTGATGAG
CCTCTCTCCA ATCTGGACGC GGCCCTGCGG GTGGCGACGC GGATCGAGAT TGCGCAGCTC
AAGGAAGCGA TGCCGGACAG CACCATGATC TATGTGACCC ACGACCAGGT GGAGGCGATG
ACGCTCGCTT CACGCATCGT TGTGCTCGCC AATAAGGGGA TCGCGCAGGT GGGCACGCCG
CTCGAACTCT ACGAGCGGCC CGAAAACGAA TTTGTCGCCC AGTTTATTGG TTCTCCGGCG
ATGAACCTGC TGCCGGGCGA GGTCATCGCA ACGGGCGATC TGACCCGCAT TCGCCTGGAG
AATGGTGAAG AGGTCGCCTC CACCGTGCCC ACCCGCACGA GCGACATGGG GCTCAAGGTC
AATGTGGGCG TGCGACCAGA GGATCTCTTT GAGGAGGGGG AGGGCGGCGC GATGATCGAC
GCTACGGTCG ACATCGTCGA AGCACTTGGT GAAGTGACGG TGCTCTATTT CAAGGCGCAA
GCCGGGCAAG ATGCGCCTGT TGCAAAATTG TCTGGTATTC ACAAAGGTTT GCGTGGAAGC
CAAGTGCGAC TCTACGCGGA TCCGAAGAAG GTACACCTCT TTCACAATGG GCATTCTCTT
CTGTATCGCG AGGGGTGA
 
Protein sequence
MANLKLTGVE KTYAGAVNVL RDINLDIKQG ELIVFVGPSG CGKSTLLRMI AGLERITGGT 
LEIDGAVMND IPPAQRGIAM VFQSYALYPH MTVRDNMGFA LKIAGKSASE IEEAITRAAK
ILQLEDYLDR LPKALSGGQR QRVAIGRAIV RDPKVYLFDE PLSNLDAALR VATRIEIAQL
KEAMPDSTMI YVTHDQVEAM TLASRIVVLA NKGIAQVGTP LELYERPENE FVAQFIGSPA
MNLLPGEVIA TGDLTRIRLE NGEEVASTVP TRTSDMGLKV NVGVRPEDLF EEGEGGAMID
ATVDIVEALG EVTVLYFKAQ AGQDAPVAKL SGIHKGLRGS QVRLYADPKK VHLFHNGHSL
LYREG