Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Cpha266_0087 |
Symbol | |
ID | 4570654 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Chlorobium phaeobacteroides DSM 266 |
Kingdom | Bacteria |
Replicon accession | NC_008639 |
Strand | + |
Start bp | 103347 |
End bp | 104657 |
Gene Length | 1311 bp |
Protein Length | 436 aa |
Translation table | 11 |
GC content | 55% |
IMG OID | 639764689 |
Product | major facilitator transporter |
Protein accession | YP_910581 |
Protein GI | 119355937 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG2814] Arabinose efflux permease |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAGCAACG CTGACGAATC GATAAACGGA AACGACAATT CCCCCGCAGA ACAGAGCCCG ACAACCCTTA AAGAGAAGGT TCTGTCGGCT TTTCCCGCAT TTCGGAGCCG AAATTTCAGA CTTTATTTTA TCGGGCAGAT CGTTTCTATG GTCGGCACCT GGCTGCAGAT GGTGGCGCAG GGCTGGCTTG TGCTTGAAAT GACCGGTTCG GCTTTCTGGG TGGGAGTGGC TGCAGCAGCA TCCTCGCTTC CGACCCTGTT TCTTTCGCTT ATCGGCGGCG TTATTGTTGA CCGGTACAAC AGGAAAACCA TTCTGCTCTG GACGCAGTCG GCTTCGATGG TGCTGGCGCT TGTGCTTGGT ATCATTACCC TTACCGGGTC GGTGACGCTT GCCGTTATTC TTGCGCTGGC GTTTCTTCTC GGCTGTGTTG CTGCGGTTGC GACACCTGCG ATTCAGGCAT TCCTGAGCGA AATGGTGGAT CGCGACCAGC TCCATTCCGC TGTAGCGCTC AATGCGGCTA TTTTCAATGC GTCGAGGGTT ATCGGTCCGG CCATTGCAGG GCTTATGATT GCATGGATCG GCACAGGTGG CGCATTCATC GCCAACGGGT TGAGTTATTT TGCTGTGATT GCCGCGCTGC TTGCCATAAC TATCGCAACT CCCCGCAAGA TACCTGCGGT GCATCAGCCT CCCTTGCAGT CGATCAGGGA CGGCATTGTC TATACGTGGG AGCACCCGGT CATCAGGACT ATCGTGATGT TCGTATCGGT GGTTTCGATA TTCGGATGGT CGTTCATGTC GATGCTGCCG GTGGTGGCCA AGCAGACTTA CGGTCTCGGT TCAGATGGCA TGGGTTACCT TTTTTCCGCA TTCGGACTTG GCTCCCTCTC AGGCACCGTG GTGGTCTCCA TGTCGTCGGG AAAGATCCGC AGCAGCGCCA TGGTGATCGG CGGTATCCTT GTTTTTTCTC TTGCTCTCAC GGCATTCACC TTTGCCTCGG ACGAACGTGT TGCGATGGCA TTTCTCTTTA TTGCAGGGAT CGGCATGCTT TCGGCTTTCG CCACCATGAC CGCTACGGTG CAGCGCCTCG TTGAGGACAG CTATCGTGGC CGGGTGATGA GCATTTACCT GATGGTGCTG ATGGGGTTTA TGCCGCTGGG CAACCTGCAG GTCGGGTTTC TTTCGGAGCA GTTCGGTACG GCTATTGCCA TAAGGATTGG CAGTATCGTC GTGCTGCTGG CAACCATTTT TCTTTTCAGC TATCGCAAAG AGATTCAGTC GGCCTGGCAT GAGTACCGGA TGCAGGAGTA G
|
Protein sequence | MSNADESING NDNSPAEQSP TTLKEKVLSA FPAFRSRNFR LYFIGQIVSM VGTWLQMVAQ GWLVLEMTGS AFWVGVAAAA SSLPTLFLSL IGGVIVDRYN RKTILLWTQS ASMVLALVLG IITLTGSVTL AVILALAFLL GCVAAVATPA IQAFLSEMVD RDQLHSAVAL NAAIFNASRV IGPAIAGLMI AWIGTGGAFI ANGLSYFAVI AALLAITIAT PRKIPAVHQP PLQSIRDGIV YTWEHPVIRT IVMFVSVVSI FGWSFMSMLP VVAKQTYGLG SDGMGYLFSA FGLGSLSGTV VVSMSSGKIR SSAMVIGGIL VFSLALTAFT FASDERVAMA FLFIAGIGML SAFATMTATV QRLVEDSYRG RVMSIYLMVL MGFMPLGNLQ VGFLSEQFGT AIAIRIGSIV VLLATIFLFS YRKEIQSAWH EYRMQE
|
| |