Gene BTH_II1917 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1917 
Symbol 
ID3844415 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2326149 
End bp2327186 
Gene Length1038 bp 
Protein Length345 aa 
Translation table11 
GC content73% 
IMG OID637839218 
Producttransglutaminase domain-containing protein 
Protein accessionYP_440111 
Protein GI83717161 
COG category[E] Amino acid transport and metabolism 
COG ID[COG1305] Transglutaminase-like enzymes, putative cysteine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones29 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGA CGACGCGCTC GCCGGCGAAC CCGAGGGCGG GCGAACGCAC GCGCAAGGCG 
GCCGTGCGCG CGGAGCCGGC GCGCGGTGCG CTGCTGCGCG TCACGCACGA CACGCGCTAT
CGCTACGCGG CGCGCGTCGA GTCCGCGCAG CATCAGGCGC GCTTGCGGCC GCTCGGGACG
CCGCGGCAGC GCGTGCTCGA GTTCTCGCTC GAAATCGATC CGCGCGCCGA AGGGCTGGTT
GTCGATACCG ATTCGTTCGG CAACGAGCGC ACGTCGTTCG CGCTCAACCA GCCGCACGAA
GAACTGTTCG TGCGCAGCCG CAGCGTCGTG CGCATCACGC CGCCTGCGCT GTCGGCGGGC
AAGCGCGGCG AGCCGCCGCC CGCGATCGCC GCGCCGCGCG ACGGGAGCGC GAGCGCGTGG
GAAGCCGTGC GCGAGTGCCT GACGTTTCGC GCGGGACGGC CGTTCGATCC GGCGAGCGAA
TTCACGTTCG CGTCGCCGCA CGCCGCGTGC CATCCGGATC TCGCCGCCTA TGCGGCGCCG
AGCTTCACGC CGGGCCGGCC GCTCGTGCAG GGCGCGTGGG AGCTGATGCG CCGCATTCAC
GCGGATTTCG CGTATGCGCC GAACAGCACC GACGTCGGCA CGACCGCGCT CGACGCGCTC
GCGCTGCGCA AGGGCGTGTG CCAGGACTTC GCTCACGTGA TGATCGGCGC GCTGCGCTCG
CTCGGGCTCG CCGCGCGCTA CGTGAGCGGC TATCTGCTGA CGCAGCCGCC GCCGGGACAG
CCGCGCCTGA TCGGCGCGGA CGCGTCGCAT GCGTGGGTCG AGGTCTACGA TCCGGCCTGG
CCCGAGGACG GCGGCTGGCT GCAGCTCGAT CCGACCAACG ATCGCGCGCC CGGCGACGAT
TACGTGATGC TGTCGATCGG CCGCGATTAC GCGGACGTGA CGCCGTTGCG CGGCGTCATT
CGCGGCGGCG GCGCCGATCA GGTGCTGACG GTCGGCGTGA CGGTGGAGCC GCTCGATCCG
GCGTCGCGAT CCGAGTGA
 
Protein sequence
MGMTTRSPAN PRAGERTRKA AVRAEPARGA LLRVTHDTRY RYAARVESAQ HQARLRPLGT 
PRQRVLEFSL EIDPRAEGLV VDTDSFGNER TSFALNQPHE ELFVRSRSVV RITPPALSAG
KRGEPPPAIA APRDGSASAW EAVRECLTFR AGRPFDPASE FTFASPHAAC HPDLAAYAAP
SFTPGRPLVQ GAWELMRRIH ADFAYAPNST DVGTTALDAL ALRKGVCQDF AHVMIGALRS
LGLAARYVSG YLLTQPPPGQ PRLIGADASH AWVEVYDPAW PEDGGWLQLD PTNDRAPGDD
YVMLSIGRDY ADVTPLRGVI RGGGADQVLT VGVTVEPLDP ASRSE