Gene BTH_I2020 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_I2020 
Symbol 
ID3849130 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007651 
Strand
Start bp2288594 
End bp2289859 
Gene Length1266 bp 
Protein Length421 aa 
Translation table11 
GC content71% 
IMG OID637841689 
Producthypothetical protein 
Protein accessionYP_442544 
Protein GI83721011 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGACGAGT TGACACTCGG GTTGATCGGC GCGGGCGCCG TCGTGGTGGG CGGCGTCGTG 
GTCTACAACG CGTGGCAGGG CGCGAAGGTG CGCCGCAAGA TGCCGCGCCC GATGCCGACC
GAGGCGGCCG AGGCCGCCGC GCGGCACGAG CGCGATGACG ATGCGCCCTT CATCGAGCCC
GTGCGGCAGC CGGCGCGCCG CGAGGCCGCA GCGGGCGGCG CGTCGGACGC GCGAAGCGAA
GACGCGGTGC GCGTCGAGCC GACGTTCGGC GGCGCGGCGC CCGCCGATAT GCCGGCTGAC
TTGCAGGCCG AGGCGACCAT CGCGAACGGC GCGGTTGTTG CGCCCGCCGC CGAGGAGACG
GACGGCGAAG CGGCCGCGCC GGCTCCCGCG CACGACGAGC CGGTCGAACC GGTGCTGCCC
GCCGCGACGA CGATTTCCGC GGCGCCGCCC GCTATCGTCG ATCGCCGCAT CGACTGCATC
GTGCCGATCC GCCTCGCGAG CCCGCTTGCG GGCGACAAGA TCCTGCCCGC CGCGCAGCGG
CTGCGCCGCG CGGGCAGCAA GCCGGTTCAC ATCGAGGGCA AGCCGGACGG CGGCGACGCA
TGGGAGCTGC TGCAAAACGG CGTGCGCTAC GAAGAGCTGC GCGCGGCCGC GCAGCTCGCG
AATCGCAGCG GTCCGCTCAA CGAACTCGAG TTCTCCGAAT TCGTGACGGG CGTCCAGCAG
TTCGCGGACG CGATCGACGG CGCGCCGGAA TTCCCGGACA TGATGGAAAC GGTGTCGATG
GCGCGCGAGC TCGACGGCTT CGCCGCGCAA TGCGACGCGC AGTTGTCGAT CAACGTGATG
TCGGACGGCG CGCCGTGGTC GGCGAACTAC GTGCAGGCGG TCGCGTCGCA GGACGGGCTG
CTGCTGTCGC GCGACGGCAC GCGCTTCGTG AAGCTCGACG CCAAGCAGAA CCCCGTCTTC
ATGCTGCAGT TCGGCGACAC GAACTTCCTG CGGGACGATC TCACGTACAA GGGCGGCAAT
CTGATCACGC TCGTGCTCGA CGTGCCCGTC GCCGACGAGG ACATTCTGCC GTTCAGACTG
ATGTGCGACT ATGCGAAATC GCTGTCCGAG CGAATCGGCG CGCGCGTCGT CGACGATCAG
CGCCGGCCGC TGCCCGAATC GACGCTGCTC GCGATCGAGC AGCAGTTGAT GAAGCTGTAC
GCGCGGCTCG AGGAGGCGGG GATTCCGGCC GGCTCGCCCG TCACGCGGCG GCTGTTCAGC
CAGTAA
 
Protein sequence
MDELTLGLIG AGAVVVGGVV VYNAWQGAKV RRKMPRPMPT EAAEAAARHE RDDDAPFIEP 
VRQPARREAA AGGASDARSE DAVRVEPTFG GAAPADMPAD LQAEATIANG AVVAPAAEET
DGEAAAPAPA HDEPVEPVLP AATTISAAPP AIVDRRIDCI VPIRLASPLA GDKILPAAQR
LRRAGSKPVH IEGKPDGGDA WELLQNGVRY EELRAAAQLA NRSGPLNELE FSEFVTGVQQ
FADAIDGAPE FPDMMETVSM ARELDGFAAQ CDAQLSINVM SDGAPWSANY VQAVASQDGL
LLSRDGTRFV KLDAKQNPVF MLQFGDTNFL RDDLTYKGGN LITLVLDVPV ADEDILPFRL
MCDYAKSLSE RIGARVVDDQ RRPLPESTLL AIEQQLMKLY ARLEEAGIPA GSPVTRRLFS
Q