Gene Cphamn1_0859 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_0859 
Symbol 
ID6374526 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp925632 
End bp926948 
Gene Length1317 bp 
Protein Length438 aa 
Translation table11 
GC content52% 
IMG OID642683367 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001959291 
Protein GI189499821 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.221691 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCATACAG AAACAAGCAC CCGCAAAATG GGCCCGATTG AACTGGACCC GTCTATACAG 
CCCTACCATG CATGGACATT TTTCTATGCG GCTTTTTTTT CTATCGGCAT GATCACCTTT
CTGTCCATCG GACAGACATA TATTCTCAAC ATCCACCTCA ACATCCCCGT TTCCGAACAA
GGCGCAATCA GCGGAGACCT TGTTTTCTGG ACAGAAATTG TAACGCTGCT TTTTTTTATT
CCCGCAGGCA TCCTCATGGA CCGTATCGGC AGAAAACCGG TCTTCACTGC CGGCTTTCTG
CTCATGGCTC TTACCTATGC CCTCTACCCT TTCGCCTCCT CAGTCAATGA CTTGCTGTTA
TTTCGAATCA TCTATGCATT CGCAATCGTG GCCATCGCGG GAGCACTGTC GACCATTCTC
GTAGACTATC CTGCAGAACG TTCAAGAGGT AAAATGGTCG CCCTTATCGG GCTGCTCAAC
GGGCTCGGAA TCGTTATCAC AAACCAGTTT TTCGGCTCGC TTCCTGAAAT TCTGACCAGC
CGGGGGATCG GAAAAATCGA AGCCGGATTC ATCACTCATC TGAGCATAGC CGCCCTGGCA
ACAATCACCG CAGTTGTCTG TTCCATAGGG CTCAAGAAAG GAACGCCGGT TAAACGTAAC
GAGCGCCCTC CGCTCAAGGA ACTGTTCCAG AGCGGCTTCA CGGCAACAAA AAACCCCAGG
ATTCTCCTCT CCTACTCCGC TGCATTCATC GCAAGAGGCG ATCAGTCGAT CAATGGCACC
TTTCTGAGCC TCTGGGGCAT TACGGCAGGT CTTGCCATGG GAATGGAGTC AGGCGAAGCA
TTCAGAAAGG GTACCACCAT ATTCATCATC ACCCAGATCG CCGCCCTTCT CTGGGCTCCA
CTCATAGGGC CGGTCATCGA CCGTATCAAC CGGGTCAGTG CTCTCGCGCT CTGCATGTTT
CTGGCCATGA TCGGCAATCT TTCGGTGCTT CTTCTCGACA ACCCTTTCGA TCCGATCGGC
TATCTGGTCT TTATCCTGCT CGGCATCGGC CAGATCAGCG TCTTTCTCGG GGCGCAGTCA
CTTATCGGCC AGGAAGCCCC TAAGGCAAAA AGAGGCTCTG TCCTCGGGGC ATTCAATATC
AGCGGCGCAA TAGGCATTCT GATTATTGCA GCAACAGGAG GAAGGCTTTT TGACAGCATG
AGCCCGAAAG CCCCCTTTGT GATTGTGGGC ACCATCAATG CCCTGCTTGT ATTTTACAGC
CTCTATGTAA GAAAAATCTC TTCTGCTCAG AGGGAACCTC TGAGCAACCC GGAATGA
 
Protein sequence
MHTETSTRKM GPIELDPSIQ PYHAWTFFYA AFFSIGMITF LSIGQTYILN IHLNIPVSEQ 
GAISGDLVFW TEIVTLLFFI PAGILMDRIG RKPVFTAGFL LMALTYALYP FASSVNDLLL
FRIIYAFAIV AIAGALSTIL VDYPAERSRG KMVALIGLLN GLGIVITNQF FGSLPEILTS
RGIGKIEAGF ITHLSIAALA TITAVVCSIG LKKGTPVKRN ERPPLKELFQ SGFTATKNPR
ILLSYSAAFI ARGDQSINGT FLSLWGITAG LAMGMESGEA FRKGTTIFII TQIAALLWAP
LIGPVIDRIN RVSALALCMF LAMIGNLSVL LLDNPFDPIG YLVFILLGIG QISVFLGAQS
LIGQEAPKAK RGSVLGAFNI SGAIGILIIA ATGGRLFDSM SPKAPFVIVG TINALLVFYS
LYVRKISSAQ REPLSNPE