Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BTH_II2103 |
Symbol | |
ID | 3844471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia thailandensis E264 |
Kingdom | Bacteria |
Replicon accession | NC_007650 |
Strand | + |
Start bp | 2582408 |
End bp | 2583649 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637839404 |
Product | sodium:dicarboxylate symporter family protein |
Protein accession | YP_440291 |
Protein GI | 83717139 |
COG category | [C] Energy production and conversion |
COG ID | [COG1301] Na+/H+-dicarboxylate symporters |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 0.151657 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACACTGG CCAATAAAAA AATTCCTTTG CCGCTACAGA TGATTCTCGG CCTGGCGCTC GGCGTGGCCT TCGGGCTGCT CGCGCCCGCC GCGAGCCGCG ACCTGGCCTT CATTTCGACG CTGTTCGGCC ACGCGATCAA GATGGTCGTG CTGCCGCTGA TCCTGCTGTC GGTGACGCTC GGCGCGTTCC GCGCGGGCAC GCAGCGCGGC CGGCTCGGCA AGACGGCCGC GTTCAGCCTC GTGTTCTTCG TGCTGATGAC GGTGATCGCC GCGTCGCTCG GCCTCGCGCT CAACTGGCTG TTCAGGCCCG GCATCGGCGC GAGCCATGCG CAGACGGCCG CGATGCCGGC GAATCTCGCA AGCGGCATCG ACTGGATGAA GTTCCTGACC GACATGATTC CGTCGAACAT CGTCGGCGCG CTCGCGGCCG GCAATTCGCT GCCGGTGCTC GTGTTCGGCG TGCTGCTCGG CTGCGCGCTC GCCGCCGTCG AGGATCGCGC GGCGCCGTTC GTCGCTGTCT GCGAATCGAT GCTCGCTGCG TTCTTCAAGA TGACCGAGTG GGTCGTGTCG CTGTCTCCGA TTGCGATCTT TGCTGCGATC GCGGTGCTGC TGTCGTCGAA AGGGCTCGCC GCGATGGCGC CGCTCGCGAA GCTGCTCGGC ATCGCATATC TCGGCATGGC GCTCCTTGCC GCATGGCTCA CGCTGATCGT CAAGCTCGCC GGCCATTCGC CGCGCGCGGT CGTGCGCAAG GTGAGCGAGC CGCTGATCCT CGGCTTCACG ACGCGCTCGT CCGAAATCAC GTTCCCCGTG CATCTGAAGA AGCTCACGGA GATGGGCGTG CCGTCGTCGG TCGCGTCGAC CGTGTTGCCG CTGTCGTACA TTTTCAATCG CGAAGGCGCG GTGCTCTACA CGGTGCTCGC GGTCTGCTAT CTCGCTGACG CATATCAGCT CGCGTGGAGC TGGCCGCTGA TGATCACGAT CGCAGTCCTG ACGATCATCA CGATCGACGG CGCGGCGAAC GTGCCGTCGG GCGCGGTCGT CGCGATCACG GTGATCCTCG CCGCGATCGG GCTGCCTGCC GATGCGGTGC TGCTGATTCT CGGCGTCGAC GCGTTCTTCG ACATGGGCCG CACCGCGCTG AACGTCTACG CGAGCACCGT CGCGACGACG CTCGCGAGCC GCATGTCCGG CGTCGCGCCC GAGATCGCGG AAGCCGCGAC GGCACACGCA TCGCGCGCTT GA
|
Protein sequence | MTLANKKIPL PLQMILGLAL GVAFGLLAPA ASRDLAFIST LFGHAIKMVV LPLILLSVTL GAFRAGTQRG RLGKTAAFSL VFFVLMTVIA ASLGLALNWL FRPGIGASHA QTAAMPANLA SGIDWMKFLT DMIPSNIVGA LAAGNSLPVL VFGVLLGCAL AAVEDRAAPF VAVCESMLAA FFKMTEWVVS LSPIAIFAAI AVLLSSKGLA AMAPLAKLLG IAYLGMALLA AWLTLIVKLA GHSPRAVVRK VSEPLILGFT TRSSEITFPV HLKKLTEMGV PSSVASTVLP LSYIFNREGA VLYTVLAVCY LADAYQLAWS WPLMITIAVL TIITIDGAAN VPSGAVVAIT VILAAIGLPA DAVLLILGVD AFFDMGRTAL NVYASTVATT LASRMSGVAP EIAEAATAHA SRA
|
| |