Gene Cpha266_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCpha266_1111 
Symbol 
ID4570151 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameChlorobium phaeobacteroides DSM 266 
KingdomBacteria 
Replicon accessionNC_008639 
Strand
Start bp1255528 
End bp1256862 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content47% 
IMG OID639765708 
Productmajor facilitator transporter 
Protein accessionYP_911576 
Protein GI119356932 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2814] Arabinose efflux permease 
TIGRFAM ID[TIGR00880] Multidrug resistance protein 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTTTGTT ATCTTGGGAG CTTTAGCGCA ATAATAGAAT CTGCTTGCAT GAAGAAGTCC 
CCACTGGTCA TCCTTCTCTT AACGGTACTG CTTGATTTGA TCGGCTTTGG CATTGTACTG
CCGCTTCTTC CGACTTATGC AAAAGACCTT GGTGCAAGCC CCCTGATGAT TGGGCTGATT
GCTGCGATTT TCTCCATCAT GCAGTTCATC TTCTCCCCGC TCTGGGGCAA ACTGAGCGAT
AAAATCGGTC GCAGACCGGT TATGCTCATC AGTATTTTCA TTACCGCCGT CTCCTATTTT
GTTTTTTCAC AGGCCGTTAC CATCCCTCTT CTTATTTTTG CCAGAGGTCT TTCCGGAATA
GGATCAGCCA ATATTGCCGC TGCCCAGGCA TACATCACCG ATGTCACCGA CAATCAAAAC
CGGTCCAAAG CCATGGGAAT GATAGGTGCG GCTTTCGGCA TCGGATTTAT TATCGGCCCA
TTGATCGGTG GCCTGCTCAA GCATAACTAC GGCATTGCTA TGGTAGGTTA TGTCGCATCA
GCTCTGATTA CTCTTGACTT TATTCTGGCG ATTTTCCTCT TGCCGGAATC CAATAAACAT
GCGATAAAAT TCAATTTCGG GTTTCTGAAA GAGAAGTCAG GAGCCGGTGC TTCAAATGAA
AAGCCAACGA GCTCGTCTGG CAATAAAATG CAAGCCTACA TTGACGGCCT TAAACTTGCT
TTCACCTCCC GACCACTTGC CCTCCTGATG ATTGCCAACT ATGTCTTCAC CTTTGCCATC
GTCAATATGC AGGTAGCCTC GATTCTACTT TGGAAAGAGT ATTTTCATGC TTCCGATCAG
GCTATCGGCT ATCTCTTCGC TTATGTGGGA TTCTTTTCTG TCGTTGTCCA GGGCGGCCTG
ATAAGCAAAC TGATCAAGGC GCTTGGCGAA CACAAGCTGT TTTTCTGGGG TCATCTTTTT
ACCTTTGTAG GGGTCTTTTT TATCCCTTTT CTGCCTTCAG ATACCCTCTT TTCGTTCGGA
CTGTTCATTC TGTTTTTCTT CGCAATCGGA ACAAGCCTGG TGGCGCCCAT AAACATCTCG
CTCATCTCGC TCTATACTTA CAAACAGAAG CAGGGGGAAA TCCTCGGACT GTCGCAATCC
ATCAACTCGT TTGCACGCAT TATGGGCCCT TTCAGCGGCA GCGTTCTCTA TGGCCTGAAC
GTCCACGCGC CTTATATCCT TGCCGGCGTG CTGACGTTGT TTGGCGCAGT GATTTCTCTC
ATGCTGTTCA AGTATAAAAT AGATGCTCTG GATCCCGATC TGGACACACA GCCATCTTGG
TCAAACAAGG ATTAA
 
Protein sequence
MFCYLGSFSA IIESACMKKS PLVILLLTVL LDLIGFGIVL PLLPTYAKDL GASPLMIGLI 
AAIFSIMQFI FSPLWGKLSD KIGRRPVMLI SIFITAVSYF VFSQAVTIPL LIFARGLSGI
GSANIAAAQA YITDVTDNQN RSKAMGMIGA AFGIGFIIGP LIGGLLKHNY GIAMVGYVAS
ALITLDFILA IFLLPESNKH AIKFNFGFLK EKSGAGASNE KPTSSSGNKM QAYIDGLKLA
FTSRPLALLM IANYVFTFAI VNMQVASILL WKEYFHASDQ AIGYLFAYVG FFSVVVQGGL
ISKLIKALGE HKLFFWGHLF TFVGVFFIPF LPSDTLFSFG LFILFFFAIG TSLVAPINIS
LISLYTYKQK QGEILGLSQS INSFARIMGP FSGSVLYGLN VHAPYILAGV LTLFGAVISL
MLFKYKIDAL DPDLDTQPSW SNKD