Gene BTH_II2140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II2140 
Symbol 
ID3844539 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2628827 
End bp2630035 
Gene Length1209 bp 
Protein Length402 aa 
Translation table11 
GC content70% 
IMG OID637839441 
Producthemin transport protein HmuS 
Protein accessionYP_440328 
Protein GI83717128 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG3720] Putative heme degradation protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones23 
Plasmid unclonability p-value0.770349 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGGCCCG CACGCTTTTT ATCCAGCCGG CTTGTCCGGC AATCGAGGAT CTCCGACATG 
ATGAACACCG CCGCTCCCGC CTCTTCGTCC GCCCGCGCGC TCGCACCCGG CGAACTGCGC
GACGCGTTCC TGCATCTGAA AGAAACCCGC AAGCTGCGCA ACCGCGACGT CGCGCAACTG
CTCGGCGTGA GCGAAGGCGA GGCGCTCGCC GCGTTCGCGG GCGAACGCGT CGTGCGGCTC
GAGCCGAGCT TCGTCGAGCT GTTCGAGGAG ATGCCGCGGC TCGGCAGCGT GATGGCGCTC
ACGCGCAACG CGGCCGCCGT GCACGAGAAG GACGGCGCGT TCGATCAGAT GAGCCACGAC
GGCCCGGTCG GCCTCGCGCT CGGCGCGATC GACCTGCGGA TCTTCTACCG CAACTGGGCG
GCGGGCTTCG CCGTCTACGA GCCGACCGCG CACGGCGTGA TGAAGAGCCT GCAGTTCTTC
GATGCGCAAG GCGACGCGGT GCACAAGGTC TACCTGCGCA AGCACAGCGA TCACGATGCA
TTCGACGCGT TCGCGTCGCG CTGGCGGATG CCCGTGCAAT CTCCGACGTT CGCGGCCGAG
CCCGCGCCGC GCGCGACCGT CGAGCGGCCC GACGCAGACG TCGACGCCGC GGGCCTGCGC
GCCGCATGGG ATGCGATGAC CGATACACAC CAGTTCCACG GCGTCGTGCG CCGCTTCGGC
GTGACGCGCA CGCAAGCGCT GCGGCTCGCC GGCGCGCCGC GCGCGCATCG CGTGACACCC
GACGCGACGC GGCGCGTGCT CGAGCGCGCC GCGCAAACGC GGCTGCCGAT CATGGTGTTC
GTCGGCAATC GCGGCATGAT CCAGATCCAT ACCGGCACCG TGACGAACAT CCGCCGCATG
GGCTCGTGGA TCAACGTGCT CGACGAGGAT TTCAACCTGC ATCTGCGCGA GGATCTCGTT
GCGTCCGCGT GGGCCGTGAA GAAACCGACG AGCGACGGCG TCGTCACGTC GGTGGAGCTG
TTCGATGCGG CGGGCGACAA CATCGCGATG CTGTTCGGCG CGCGCAAGCC CGGGCAGCCG
GAGCTCGCGG GCTGGCGCGA GCTGGTCGGC GCGCTGCCGA AAATCGATGC GGCGGATGCG
GCGAGCTCGG CGAACGCCGC GGATTCGATC GACGTGCACG GCTCGACCGA CGCCGAGGCC
GCGCGATGA
 
Protein sequence
MRPARFLSSR LVRQSRISDM MNTAAPASSS ARALAPGELR DAFLHLKETR KLRNRDVAQL 
LGVSEGEALA AFAGERVVRL EPSFVELFEE MPRLGSVMAL TRNAAAVHEK DGAFDQMSHD
GPVGLALGAI DLRIFYRNWA AGFAVYEPTA HGVMKSLQFF DAQGDAVHKV YLRKHSDHDA
FDAFASRWRM PVQSPTFAAE PAPRATVERP DADVDAAGLR AAWDAMTDTH QFHGVVRRFG
VTRTQALRLA GAPRAHRVTP DATRRVLERA AQTRLPIMVF VGNRGMIQIH TGTVTNIRRM
GSWINVLDED FNLHLREDLV ASAWAVKKPT SDGVVTSVEL FDAAGDNIAM LFGARKPGQP
ELAGWRELVG ALPKIDAADA ASSANAADSI DVHGSTDAEA AR