Gene Cpha266_0087 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_0087 
Symbol 
ID4570654 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp103347 
End bp104657 
Gene Length1311 bp 
Protein Length436 aa 
Translation table11 
GC content55% 
IMG OID639764689 
Productmajor facilitator transporter 
Protein accessionYP_910581 
Protein GI119355937 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAACG CTGACGAATC GATAAACGGA AACGACAATT CCCCCGCAGA ACAGAGCCCG 
ACAACCCTTA AAGAGAAGGT TCTGTCGGCT TTTCCCGCAT TTCGGAGCCG AAATTTCAGA
CTTTATTTTA TCGGGCAGAT CGTTTCTATG GTCGGCACCT GGCTGCAGAT GGTGGCGCAG
GGCTGGCTTG TGCTTGAAAT GACCGGTTCG GCTTTCTGGG TGGGAGTGGC TGCAGCAGCA
TCCTCGCTTC CGACCCTGTT TCTTTCGCTT ATCGGCGGCG TTATTGTTGA CCGGTACAAC
AGGAAAACCA TTCTGCTCTG GACGCAGTCG GCTTCGATGG TGCTGGCGCT TGTGCTTGGT
ATCATTACCC TTACCGGGTC GGTGACGCTT GCCGTTATTC TTGCGCTGGC GTTTCTTCTC
GGCTGTGTTG CTGCGGTTGC GACACCTGCG ATTCAGGCAT TCCTGAGCGA AATGGTGGAT
CGCGACCAGC TCCATTCCGC TGTAGCGCTC AATGCGGCTA TTTTCAATGC GTCGAGGGTT
ATCGGTCCGG CCATTGCAGG GCTTATGATT GCATGGATCG GCACAGGTGG CGCATTCATC
GCCAACGGGT TGAGTTATTT TGCTGTGATT GCCGCGCTGC TTGCCATAAC TATCGCAACT
CCCCGCAAGA TACCTGCGGT GCATCAGCCT CCCTTGCAGT CGATCAGGGA CGGCATTGTC
TATACGTGGG AGCACCCGGT CATCAGGACT ATCGTGATGT TCGTATCGGT GGTTTCGATA
TTCGGATGGT CGTTCATGTC GATGCTGCCG GTGGTGGCCA AGCAGACTTA CGGTCTCGGT
TCAGATGGCA TGGGTTACCT TTTTTCCGCA TTCGGACTTG GCTCCCTCTC AGGCACCGTG
GTGGTCTCCA TGTCGTCGGG AAAGATCCGC AGCAGCGCCA TGGTGATCGG CGGTATCCTT
GTTTTTTCTC TTGCTCTCAC GGCATTCACC TTTGCCTCGG ACGAACGTGT TGCGATGGCA
TTTCTCTTTA TTGCAGGGAT CGGCATGCTT TCGGCTTTCG CCACCATGAC CGCTACGGTG
CAGCGCCTCG TTGAGGACAG CTATCGTGGC CGGGTGATGA GCATTTACCT GATGGTGCTG
ATGGGGTTTA TGCCGCTGGG CAACCTGCAG GTCGGGTTTC TTTCGGAGCA GTTCGGTACG
GCTATTGCCA TAAGGATTGG CAGTATCGTC GTGCTGCTGG CAACCATTTT TCTTTTCAGC
TATCGCAAAG AGATTCAGTC GGCCTGGCAT GAGTACCGGA TGCAGGAGTA G
 
Protein sequence
MSNADESING NDNSPAEQSP TTLKEKVLSA FPAFRSRNFR LYFIGQIVSM VGTWLQMVAQ 
GWLVLEMTGS AFWVGVAAAA SSLPTLFLSL IGGVIVDRYN RKTILLWTQS ASMVLALVLG
IITLTGSVTL AVILALAFLL GCVAAVATPA IQAFLSEMVD RDQLHSAVAL NAAIFNASRV
IGPAIAGLMI AWIGTGGAFI ANGLSYFAVI AALLAITIAT PRKIPAVHQP PLQSIRDGIV
YTWEHPVIRT IVMFVSVVSI FGWSFMSMLP VVAKQTYGLG SDGMGYLFSA FGLGSLSGTV
VVSMSSGKIR SSAMVIGGIL VFSLALTAFT FASDERVAMA FLFIAGIGML SAFATMTATV
QRLVEDSYRG RVMSIYLMVL MGFMPLGNLQ VGFLSEQFGT AIAIRIGSIV VLLATIFLFS
YRKEIQSAWH EYRMQE