Gene Daro_3345 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagDaro_3345 
Symbol 
ID3568300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameDechloromonas aromatica RCB 
KingdomBacteria 
Replicon accessionNC_007298 
Strand
Start bp3601586 
End bp3602839 
Gene Length1254 bp 
Protein Length417 aa 
Translation table11 
GC content63% 
IMG OID637681817 
Productmajor facilitator transporter 
Protein accessionYP_286544 
Protein GI71908957 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2271] Sugar phosphate permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones56 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.231985 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCGCC GCTGGCTGGC CCTTGGCATC GTGGCGCTGG CCTATATCCT GTCCTTCTTC 
CAGCGTTTCG CTCCGGCCGG GATCGCCCAG GATCTGGCGG CCTCTTTCGA GACGTCGGCG
GCTTCGCTCG GCATCCTGGC TGCGACCTAT TTTTACGTTT ATACGCTGAT GCAGGTGCCG
ACCGGCATTC TCGTCGATAC GCTCGGGCCG CGGCGTATCC TGGCTCTTGG CGGATTGATC
GGCGGTGCCG GCAGCTTCCT TTTTGGATTC GCGCCAAGCC TTGAGCTGGC TCTGGTTGGT
CGCACACTGA TCGGGTTTGG TGTGTCGGTC ACCTTCATCG CCATGCTCAA GCTGGTTGCC
GTGTGGTTCG AGGAAAACCG CTTTGCCACC ATGGTCGGTA TCTGCATGCT GATCGGCAAT
CTTGGTTCGG TGCTGGCCGG TGCGCCCCTC TCGGCGCTGG CCCAGGCGAC CGGCTGGCGT
GGCGTGTTCA TCGGCGTCGG CTTTGCTTCG CTGGTTTTGG GCGCCTTGTG CTGGCTGATC
GTCCGCGATA CGCCGGAATC CGGGGTGGCC GTCCCCAAGC CACACTTCGA CCGGACGGCG
GTGTTGTCCA ACCTGTGGGC CGTGGTCAAA AACAGGGATA CCTGGCCGGC GGTGGCGGTC
AATACCGGCA TGTCCGGCGC CTTTTTTACC TTCGCCGGCT TGTGGGCGAT GCCTTATTTG
ATGCAAGTGC ATGGCCTGGC GCGTGCCGTC GCGGCGACAC ACCTGTCCCT GTGGTTTGGC
GGCTTTGCCA TTGGTTGCCT GTTCATCGGC GGTTTGTCCG ACCGTCTGGG CCGGCGCAAA
CCGGTGCTGA TCGTCGCATC GCACCTTTAT GGCGCAATCT GGCTGATCTG GCTGTCCTGC
ACGACGATGC CGCTGGCGTT GTCGTATGCA CTTTTTGCCC TGATGGGGCT GACGACGGCG
GGATTCAGTC TGACCTGGGC CTGTTCCAAG GAAGTGAATC CGCCCATGCT GTCCGGCATG
TCGACCAGCG TCGCCAACAT GGGGGGCTTC CTGGCCGGTG CGCTGCTGCA ACCGCTGGTT
GGCTGGATCA TGGATCTCGG CTGGAAGGGT GAGATGGTCA GTGGCGCCCG TGTTTACGAT
GTCGAAATCT GGCGCCATGG CGTCCTGGTG CTCACCGTCT GCGCCATTCT GGGCGCTGCA
TCCTGCTGGT GGATCAGGGA AACGCGCTGC CGGAATATCT GGCAGGCTGG GTAA
 
Protein sequence
MRRRWLALGI VALAYILSFF QRFAPAGIAQ DLAASFETSA ASLGILAATY FYVYTLMQVP 
TGILVDTLGP RRILALGGLI GGAGSFLFGF APSLELALVG RTLIGFGVSV TFIAMLKLVA
VWFEENRFAT MVGICMLIGN LGSVLAGAPL SALAQATGWR GVFIGVGFAS LVLGALCWLI
VRDTPESGVA VPKPHFDRTA VLSNLWAVVK NRDTWPAVAV NTGMSGAFFT FAGLWAMPYL
MQVHGLARAV AATHLSLWFG GFAIGCLFIG GLSDRLGRRK PVLIVASHLY GAIWLIWLSC
TTMPLALSYA LFALMGLTTA GFSLTWACSK EVNPPMLSGM STSVANMGGF LAGALLQPLV
GWIMDLGWKG EMVSGARVYD VEIWRHGVLV LTVCAILGAA SCWWIRETRC RNIWQAG