Gene Cphamn1_1524 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1524 
Symbol 
ID6375202 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1646882 
End bp1648159 
Gene Length1278 bp 
Protein Length425 aa 
Translation table11 
GC content49% 
IMG OID642684017 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001959931 
Protein GI189500461 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAAAT CACCTTTGGC AATATTGTTT TTGACAGTAC TGCTTGACCT TATCGGGTTC 
GGTATTGTCC TGCCTCTTCT TCCTACCTAC GCCAAAGATC TCGGAGCGAG CCCTTTGATG
ATCGGCTTTA TTGCTGCCAC CTATTCCGGA ATGCAGTTTC TTTTTTCACC CATCTGGGGA
AGGTTGAGCG ACTTCATCGG ACGAAGACCG GTGATGCTGG TCAGTATCTT TATGGCCGCA
GTTTCCTATC TTTTTTTCGC CCATGCGTCA ACCATCCCGT TACTGATACT GGCACGTGCT
CTTTCCGGCA TCGGATCGGC GAACATCGCG GCTGCCCAGG CCTACATCAC CGACGTTACC
GACAGCAAAA GCAGATCTAC CGCCATGGGG ATGCTCGGGG CCGCTTTTGG AATCGGTTTT
ATTATCGGAC CGTTAATCGG CGGCTTTCTG AAGCACAATT TCGGCATTGA AATGGTCGGC
TACGTCGCTT CCGCGCTGAT AGCTCTGGAC TTTATCCTTG CTGTTTTTTT CCTTACGGAA
TCAAACAAAG ATGCGCAGAA AATCTCTCAC TTTCTCAAGA TTTCACTGGC ACGCACAGGC
AAACCGATAC TGACCTCGAT GCAAGAGAAA TCAGCCGCCT ATTTTAAGGG TATCAGTAAC
GCGCTAAGCC ACAAGCCCAT AGCACTGCTC ATGAGCGCCA ATTTTATTTT CACTTTTGGC
ATCATCAACA TGCAGATAGC GGCCATTCTT CTGTGGAAAG AATATTTCAT GGCAACCGAC
CAGCAGATAG GCTACCTATT CGCATACGTA GGTTTCATAT CGGTAGTTGT CCAGGGGGGC
CTGATCGGAA AATTGAATAA GCGATTCGGT GAACACAAGC TTTTCCTCTT GGGGCATATC
ATTACCTTTG TGGGTGTTTT TTTTATCCCG TTCATCCCTC CAACGACGCT GTTTACTCTT
GGGCTGGGAA TCCTTCTGTT CTTCTCAATA GGAACAAGCC TCGTAAACCC GATCAACATT
TCATTGATCT CCCTCTACAG CTACACCCAG AAACAGGGAC AGATCATGGG CTACGGACAG
TCCGTCAATT CCCTTGCCCG GATATTAGGC CCGTTCAGCG GCAGCATTCT TTATGGAATG
ATCCCTTCCA TGCCGTTTGT CGTGGCAGGG GTGCTCATGC TCGTCGGAAC AATAATTTCC
CTTAGCTTGT TTAAATACGA CATAGAGGCT TTGGAGCCGG AACCCGAATC CTCGACTGAC
CCGCAAACAG CTGAATAG
 
Protein sequence
MKKSPLAILF LTVLLDLIGF GIVLPLLPTY AKDLGASPLM IGFIAATYSG MQFLFSPIWG 
RLSDFIGRRP VMLVSIFMAA VSYLFFAHAS TIPLLILARA LSGIGSANIA AAQAYITDVT
DSKSRSTAMG MLGAAFGIGF IIGPLIGGFL KHNFGIEMVG YVASALIALD FILAVFFLTE
SNKDAQKISH FLKISLARTG KPILTSMQEK SAAYFKGISN ALSHKPIALL MSANFIFTFG
IINMQIAAIL LWKEYFMATD QQIGYLFAYV GFISVVVQGG LIGKLNKRFG EHKLFLLGHI
ITFVGVFFIP FIPPTTLFTL GLGILLFFSI GTSLVNPINI SLISLYSYTQ KQGQIMGYGQ
SVNSLARILG PFSGSILYGM IPSMPFVVAG VLMLVGTIIS LSLFKYDIEA LEPEPESSTD
PQTAE