Gene GWCH70_2765 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2765 
Symbol 
ID7977988 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2801942 
End bp2803135 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content48% 
IMG OID644799561 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_002950720 
Protein GI239828096 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000543377 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCCGTG CGCTTTGGAT ATTAATGATC GGCATGGCCA TCAACGTGAC AGGGGCTTCT 
TTTTTATGGC CGATGAATAC GATTTACGTT CATGAGCAGC TTGGCAAATC ACTCTCTGTC
GCTGGAATGG TGCTCATGCT CAATTCTGGA GCAAGCGTGG TTGGAAACTT AGCCGGAGGG
CTATTGTTTG ATAAAATTGG CGGTTTTAAG TCGATGATGA TCGGAATTGT CGCGACGATG
TCCGCGCTTG TTGGCCTTGT TTTCTTTCAT GGCTGGCCGC ATTATGCCAT CTTTTTAACG
ATCATCGGGA TGGGAAGCGG GATTGTATTT CCAGTAGCGG CTGCGTATGC GGGAGCAATT
TGGCCAGAGG GCGGAAGACG GGCATTTAAC GGGCTTTATG TCGCGCAAAA TATTGGCGTG
GCAGTTGGTT CGGCGCTTGG CGGTGTAGTC GCATCGTATT CGTTTACGCT TATTTTCCTT
GCCAATTTAT TGCTATACGC CGTCTTTTGC TTGCTTATCG TGTTTGGATT GCGTCATGTC
CGCGCTGTCA AAGCGGCAAA ATCAAAGGAG GCGGAAGCAG TGAAAGGAGA GGCGCGGGCC
AATTGGTATG CGCTCGTGAC GCTTTGTACC GGTTATTTTT TATGTTGGAT TAGCTATGTG
CAATGGGCAA CGACAATTGC TGCATATACG CAAGAGCTCC ACATTACGCT GAAACAATAT
AGCCTTCTTT GGACGATTAA TGGTGCGCTT ATTGTATTCG CGCAACCTTT CTTGTCAGCA
GTTGTGCAAC GATGGATGCG GCATGTAAAA CGGCAAATGC TTGTTGGATT TGCTATTTTT
GTCGTTTCTT TTTCTCTTTT GCTGCAAGCG AATTCGTTTT TTGAATTTGC GCTGGCAATG
GTTGTATTAA CGATTGCGGA AATGCTTGTA TGGCCGGCGA TTCCAACGGT GGCAAGCGAG
CTCGCTCCTG CAGGGAAAGA AGGGTTTTAC CAAGGATTTG TCAACAGCAC GGCAACGGCA
GGGAGAATGA TCGGACCTGT GCTAGGCGGC TTTATCGCTG ATTACGCTGG CATGAAACCG
CTGTTTGCCG CGCTTGTTTT CTTTTGCGCG CTTTCGCTTG TCACGACGTT CTTTTATGAT
CGGAACTTGC CAAAGAAGCA AAACAAGGGG AAACAAACGG TTACAGTAGG GTAG
 
Protein sequence
MPRALWILMI GMAINVTGAS FLWPMNTIYV HEQLGKSLSV AGMVLMLNSG ASVVGNLAGG 
LLFDKIGGFK SMMIGIVATM SALVGLVFFH GWPHYAIFLT IIGMGSGIVF PVAAAYAGAI
WPEGGRRAFN GLYVAQNIGV AVGSALGGVV ASYSFTLIFL ANLLLYAVFC LLIVFGLRHV
RAVKAAKSKE AEAVKGEARA NWYALVTLCT GYFLCWISYV QWATTIAAYT QELHITLKQY
SLLWTINGAL IVFAQPFLSA VVQRWMRHVK RQMLVGFAIF VVSFSLLLQA NSFFEFALAM
VVLTIAEMLV WPAIPTVASE LAPAGKEGFY QGFVNSTATA GRMIGPVLGG FIADYAGMKP
LFAALVFFCA LSLVTTFFYD RNLPKKQNKG KQTVTVG