Gene BTH_II2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2103 
Symbol 
ID3844471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2582408 
End bp2583649 
Gene Length1242 bp 
Protein Length413 aa 
Translation table11 
GC content67% 
IMG OID637839404 
Productsodium:dicarboxylate symporter family protein 
Protein accessionYP_440291 
Protein GI83717139 
COG category[C] Energy production and conversion 
COG ID[COG1301] Na+/H+-dicarboxylate symporters 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.151657 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACACTGG CCAATAAAAA AATTCCTTTG CCGCTACAGA TGATTCTCGG CCTGGCGCTC 
GGCGTGGCCT TCGGGCTGCT CGCGCCCGCC GCGAGCCGCG ACCTGGCCTT CATTTCGACG
CTGTTCGGCC ACGCGATCAA GATGGTCGTG CTGCCGCTGA TCCTGCTGTC GGTGACGCTC
GGCGCGTTCC GCGCGGGCAC GCAGCGCGGC CGGCTCGGCA AGACGGCCGC GTTCAGCCTC
GTGTTCTTCG TGCTGATGAC GGTGATCGCC GCGTCGCTCG GCCTCGCGCT CAACTGGCTG
TTCAGGCCCG GCATCGGCGC GAGCCATGCG CAGACGGCCG CGATGCCGGC GAATCTCGCA
AGCGGCATCG ACTGGATGAA GTTCCTGACC GACATGATTC CGTCGAACAT CGTCGGCGCG
CTCGCGGCCG GCAATTCGCT GCCGGTGCTC GTGTTCGGCG TGCTGCTCGG CTGCGCGCTC
GCCGCCGTCG AGGATCGCGC GGCGCCGTTC GTCGCTGTCT GCGAATCGAT GCTCGCTGCG
TTCTTCAAGA TGACCGAGTG GGTCGTGTCG CTGTCTCCGA TTGCGATCTT TGCTGCGATC
GCGGTGCTGC TGTCGTCGAA AGGGCTCGCC GCGATGGCGC CGCTCGCGAA GCTGCTCGGC
ATCGCATATC TCGGCATGGC GCTCCTTGCC GCATGGCTCA CGCTGATCGT CAAGCTCGCC
GGCCATTCGC CGCGCGCGGT CGTGCGCAAG GTGAGCGAGC CGCTGATCCT CGGCTTCACG
ACGCGCTCGT CCGAAATCAC GTTCCCCGTG CATCTGAAGA AGCTCACGGA GATGGGCGTG
CCGTCGTCGG TCGCGTCGAC CGTGTTGCCG CTGTCGTACA TTTTCAATCG CGAAGGCGCG
GTGCTCTACA CGGTGCTCGC GGTCTGCTAT CTCGCTGACG CATATCAGCT CGCGTGGAGC
TGGCCGCTGA TGATCACGAT CGCAGTCCTG ACGATCATCA CGATCGACGG CGCGGCGAAC
GTGCCGTCGG GCGCGGTCGT CGCGATCACG GTGATCCTCG CCGCGATCGG GCTGCCTGCC
GATGCGGTGC TGCTGATTCT CGGCGTCGAC GCGTTCTTCG ACATGGGCCG CACCGCGCTG
AACGTCTACG CGAGCACCGT CGCGACGACG CTCGCGAGCC GCATGTCCGG CGTCGCGCCC
GAGATCGCGG AAGCCGCGAC GGCACACGCA TCGCGCGCTT GA
 
Protein sequence
MTLANKKIPL PLQMILGLAL GVAFGLLAPA ASRDLAFIST LFGHAIKMVV LPLILLSVTL 
GAFRAGTQRG RLGKTAAFSL VFFVLMTVIA ASLGLALNWL FRPGIGASHA QTAAMPANLA
SGIDWMKFLT DMIPSNIVGA LAAGNSLPVL VFGVLLGCAL AAVEDRAAPF VAVCESMLAA
FFKMTEWVVS LSPIAIFAAI AVLLSSKGLA AMAPLAKLLG IAYLGMALLA AWLTLIVKLA
GHSPRAVVRK VSEPLILGFT TRSSEITFPV HLKKLTEMGV PSSVASTVLP LSYIFNREGA
VLYTVLAVCY LADAYQLAWS WPLMITIAVL TIITIDGAAN VPSGAVVAIT VILAAIGLPA
DAVLLILGVD AFFDMGRTAL NVYASTVATT LASRMSGVAP EIAEAATAHA SRA