Gene Cphamn1_1823 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCphamn1_1823 
Symbol 
ID6375514 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides BS1 
KingdomBacteria 
Replicon accessionNC_010831 
Strand
Start bp1977420 
End bp1978613 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content53% 
IMG OID642684320 
Productmajor facilitator superfamily MFS_1 
Protein accessionYP_001960222 
Protein GI189500752 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.536913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACAGA ACAGAGAAGC TGTCATACGG ACCGTTGTTC CCGGCTCTTT AACCGCCTTT 
TCAACGGTGA TAAGTGATAA CCAACAGGAT TGCATGTCAG GTAAGTCAGG CAGGAGTACC
GTGAAATTTG CAGAATCTGT CGCGCTTATC GCGATGATGA TGTCTCTCGC CGCGCTTTCT
ATCGATGCCA TGCTCCCTGC CCTTCCTGCG ATCGGCCGTG AACTTGGCGT CCTGCAGGAA
AACACCAACC AGCTTGTCAT CTCTCTCTTG TTTCTCGGCA TGTCGGCCGG ACAGATTCTT
TACGGTCCGA TGTCTGATTC AGCCGGCAGA AAGCCCGCGA TCTATACCGG TTTCGGCATT
TTCATCACAG GAACGCTCTT CTGCCTCTTC GCCACGAGCT TCACCATGAT GCTCTCCGGC
AGGATCCTTC AGGGGGTAGG AGCCGCAAGC ACCCGCATTG TTTCCATAGC AATCGTGCGT
GATCAATACG AGGGACCGAA AATGGCGCGC GTCATGTCTT TTGTCATGAC AATATTTATC
CTCATACCGA TTCTCGCACC TGCTCTCGGG CAAACCATGC TTAACGCATC TGGATGGAGA
GCCATCTTCG GTATCTTACT GTTTCTCTCC CTCTTCACGC TCGCCTGGTT CTCATTACGC
CAGCCAGAGA CCCTGAGCAG GGAAAAACGC ATCCCGTTTA CCATCAAAAG AATAGTGACA
GCCATCCGTG AAGTACTGGG TATTCGACAG TCATTAGCCT ATACCATCAT TTCAGGCCTC
GTCTTCGGTT CTTTTCTCGG ATACCTGAAC TCCTCTCAGC AGATCCTGCA GATACAGTAT
GGACTCGGAG AAGATTTCCC GCTCTACTTC GGCATACTTG CCACCGCTTT CGGTGCAGCG
ACCCTGCTGA ACTCAAAACT CGTCATGCTC TTCAGAATGC ACTCTCTCGT CCATCATGCG
ATGCACGCCC TTGCCGTGCT CTCCGGGTTG TTTCTCGTCG CTGCAATGAC GCAAAACGGG
CACCCTCCCC TCTGGGCTTT CCTCATCTAC CTTCTGCCTG TTTTTTTACC ATCGGCATCC
TGTTCGGAAA CCTCAACACT CTGGCAATGG AACCACTCGG GCACATTGCG GGTATCGGGG
CATCGACAAT CGGCTCCCTC TCGACCTTCC TCGCGTTGTC AGTCGGTACG GTGA
 
Protein sequence
MPQNREAVIR TVVPGSLTAF STVISDNQQD CMSGKSGRST VKFAESVALI AMMMSLAALS 
IDAMLPALPA IGRELGVLQE NTNQLVISLL FLGMSAGQIL YGPMSDSAGR KPAIYTGFGI
FITGTLFCLF ATSFTMMLSG RILQGVGAAS TRIVSIAIVR DQYEGPKMAR VMSFVMTIFI
LIPILAPALG QTMLNASGWR AIFGILLFLS LFTLAWFSLR QPETLSREKR IPFTIKRIVT
AIREVLGIRQ SLAYTIISGL VFGSFLGYLN SSQQILQIQY GLGEDFPLYF GILATAFGAA
TLLNSKLVML FRMHSLVHHA MHALAVLSGL FLVAAMTQNG HPPLWAFLIY LLPVFLPSAS
CSETSTLWQW NHSGTLRVSG HRQSAPSRPS SRCQSVR