Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Bcep18194_A3705 |
Symbol | araH |
ID | 3748889 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia sp. 383 |
Kingdom | Bacteria |
Replicon accession | NC_007510 |
Strand | + |
Start bp | 593996 |
End bp | 595012 |
Gene Length | 1017 bp |
Protein Length | 338 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637761985 |
Product | L-arabinose transporter permease protein |
Protein accession | YP_367950 |
Protein GI | 78065181 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG1172] Ribose/xylose/arabinose/galactoside ABC-type transport systems, permease components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 20 |
Plasmid unclonability p-value | 0.256894 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.416608 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCAAGTCA ACGAAAACCT TGGCAGCGCC GCCGTGAAGC CGTCGGCCGA CGCGCTGGTG CCGCAGCAGA GCGATCGCCA GAAGTGGTGG CAGCATCTGA CCGAATACAG CCTGATCGCG ATCTTCGCGG TGATGTTCAT CACGATGTCG CTCACCGTCG ATCACTTCTT CTCGATCGAC AACATGCTCG GCCTCGCGCT GTCGATCTCG CAGATCGGGA TGGTCGCGTG CACGATGATG TTCTGTCTCG CGTCGCGCGA CTTCGACCTG TCGATCGGCT CGACCGTCGC GTTCTCCGGC GTGCTGTGCG CGATGGTGCT GAACGCGACC GACAACACGT TCGTCGCGAT CATCGCGGCG GTCGCGGCCG GCGCCGCGAT CGGCTTCGTG AACGGCGCGG TGATCGCATA CCTGCGCATC AACGCGCTGA TCACCACGCT CGCGACGATG GAGATCGTGC GCGGGCTCGG CTTCATCGTG TCGAAGGGGC AGGCGGTCGG CGTGTCGTCG GATACGTTCA TCGCGCTCGG CGGGCTGTCG CTGTTCGGCG TGTCGCTGCC GATCTGGGTC ACGCTGCTGT GCTTCATCGC GTTCGGCGTG CTGCTGAACC AGACGGTATA CGGCCGCAAC ACGCTCGCGA TCGGCGGTAA CCCGGAAGCG TCGCGGCTCG CGGGGATCAA CGTCGAACGC ACGCGCGTGT ACATCTTCCT GATCCAGGGC GCGGTGACGG CGCTCGCGGG CGTGATCCTC GCGTCGCGCA TCACGTCGGG CCAGCCGAAC GCCGCGCAGG GCTTCGAGCT GAACGTGATC TCGGCGTGCG TGCTCGGCGG CGTGTCGCTG ATGGGCGGCC GCGCGACGAT CTCGGGCGTC GTGATCGGCG TGCTGATCAT GGGCACCGTC GAGAACGTGA TGAACCTGCT GAACATCGAC GCGTTCTACC AGTACCTCGT GCGCGGCGCG ATCCTGCTCG CGGCCGTGCT GCTCGACCAG TTGAAGAACC GCGGCGTCCG CGACTGA
|
Protein sequence | MQVNENLGSA AVKPSADALV PQQSDRQKWW QHLTEYSLIA IFAVMFITMS LTVDHFFSID NMLGLALSIS QIGMVACTMM FCLASRDFDL SIGSTVAFSG VLCAMVLNAT DNTFVAIIAA VAAGAAIGFV NGAVIAYLRI NALITTLATM EIVRGLGFIV SKGQAVGVSS DTFIALGGLS LFGVSLPIWV TLLCFIAFGV LLNQTVYGRN TLAIGGNPEA SRLAGINVER TRVYIFLIQG AVTALAGVIL ASRITSGQPN AAQGFELNVI SACVLGGVSL MGGRATISGV VIGVLIMGTV ENVMNLLNID AFYQYLVRGA ILLAAVLLDQ LKNRGVRD
|
| |