Gene BTH_II1891 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBTH_II1891 
Symbol 
ID3845344 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia thailandensis E264 
KingdomBacteria 
Replicon accessionNC_007650 
Strand
Start bp2290614 
End bp2291684 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content69% 
IMG OID637839192 
Productpentapeptide repeat-containing protein 
Protein accessionYP_440085 
Protein GI83716702 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones26 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAAGA TCCGCTCGGC GGTGCCGCCG CCGCCGCTGC CCGAGATCGT CGAGGGGCAG 
CGCTACGTGT CTGCGCAGCG CGACGTCGCG CTCGCCGACA CGCTGTTCGT CGATTGCCAC
TTCGAGCGCG TCGAATGGAC CGGCTGCCGG CTGTCGAACC TGCGCTTCGT GAACTGCACG
TTCGATGCGA ACCGCTTCGA TCGATGCGAG CTCGAGAAGC TCTCGTACGA ATCGAGCCGG
GTCCGCGAGG GCGCGTGGAC GCAAAGCGCG TTGCAGCGCG TGTCGTTCAA CGAGTGCGAG
ATCGACGGGG GCGCGTGGGC GGGCTGCCTG CTGAAGGACG TCGTGTGCTC GCAGTCGAAG
GGCGGCGCCT GGACGTTCGA CGCCGTGCGC GGCGCGCACG TGTCGCTCGT CGCGGGCGAA
TACGCGGGCG TCACGCTGCG CGGCGGCCAC TGGAGCGACA CGTCGTGGAT CGGCAGCCGG
CTCGTCGATC TGCGGCTCGA ATCGGTCGGG CTCGAGAATC TGATCGCCGG GCAAAGCGGC
TTCGAGCGCG CGGTGCTCGT CGAATGCCGC GGGATCAACG TACGCTGGAT CGATTCCCGG
ATCGAGCGGA TGACCGTTCA AGGCTGCGAG CTGAAGCAGG CTGCCTGGTC GCACAGCACA
TGGGCGACGG GCGAGATCCA CGCGAGCCGG CTGCCGATCG CGAGCTTCGA TCACGCGAGC
GTCAACGGCC TGACGGTGAC GAACAGCGAA TTGCCGCAGG CGATCTTCGA CAGCGCGAGC
GTGGCGGACA GCGCGCTGCA AGGCGTGCGC GCGCCGCGCA TCGCGTTGCG CGACGCATGG
CTCACGCGCG TGAACCTCGC GGGCGCGCAA TTGCAGCAGC TCGACGCGCG CGGCGTGCGT
CTGGAGCGCG TCGACTTGCG CGGCGCCGAT TGCCGCAGCG GCAATCTGGT CGGCCAGCTT
CGCCAGACGT GGGCGGCGGC CGATACGCGG GACGCGGTTT TCGAGGAAGC CACGAGTGCC
GACGATCGGC TCTGGTGGCA GCGAGTGCAA CCCGGAGCAA GAGGAGTTTG A
 
Protein sequence
MSKIRSAVPP PPLPEIVEGQ RYVSAQRDVA LADTLFVDCH FERVEWTGCR LSNLRFVNCT 
FDANRFDRCE LEKLSYESSR VREGAWTQSA LQRVSFNECE IDGGAWAGCL LKDVVCSQSK
GGAWTFDAVR GAHVSLVAGE YAGVTLRGGH WSDTSWIGSR LVDLRLESVG LENLIAGQSG
FERAVLVECR GINVRWIDSR IERMTVQGCE LKQAAWSHST WATGEIHASR LPIASFDHAS
VNGLTVTNSE LPQAIFDSAS VADSALQGVR APRIALRDAW LTRVNLAGAQ LQQLDARGVR
LERVDLRGAD CRSGNLVGQL RQTWAAADTR DAVFEEATSA DDRLWWQRVQ PGARGV