Gene BURPS1710b_1680 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1710b_1680 
Symbol 
ID3691521 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1710b 
KingdomBacteria 
Replicon accessionNC_007434 
Strand
Start bp1799308 
End bp1800558 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content59% 
IMG OID637728136 
ProductHK97 family phage portal protein 
Protein accessionYP_333083 
Protein GI76809174 
COG category[S] Function unknown 
COG ID[COG4695] Phage-related protein 
TIGRFAM ID[TIGR01537] phage portal protein, HK97 family 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.905165 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGCCCATTT CGCTGACCGA CGGGAGCTTC TGGTCCGCGT GGGGCGGTAT GGGGTCATCG 
AGCGGAGAGA CGGTGACGGC CGATTCGGCA CTTCAGCTAT CTGCGGTGTG GTCGTGTGTC
CGTCTGATCG CGGAAACAAT CGCGACTCTT CCGTTGAATC TCTATCAGAC CAAGCCAGAC
GGAACGCGTG TTCTCGCGAA GCAACACCGG CTGTACACGG TCATCCATTC TCAGCCAAAC
GCAGAGAACA CTGCGGCCGA GTTCTGGGAA GTGATCGTCG CGAGCATGCT GCTATGGGGG
AATGGGTACG CGAGAAAGCT CCGGCCGGCG GGTGTGCTCA TCGGCCTTGA GCTGATGCTG
CCACAGCGTA CGACTGTGAA GCGCCTCACA AGCGGAGCGT TGCAATACAC CTATCGCAAC
GTCGATGGAA CTGTCAGCAC GCTGGCCGAG GACGATGTGT TTCACGTTCG AGGGTTCAGT
CTCGATGGCT TGATGGGTCT TACGCCGATT CAATACGCAC GTGAGGTTCT TGGGAATTCG
ACGGCCGCGA ATAAGACGAG CGCGAGCGTC TTTCGGAATG GGTTGCGACC ATCAGGTGTG
CTCTCGACCG ACCAGATCCT CCAGAAAGAA AAGCGTGCGG AGATTCGAAC GGATCTAGCA
GAGCAGTTTG GCGGCGCCAT GCAGGCCGGG AAAACGATGG TGCTGGAAGC CGGGATGAAG
TACCAGGCCA TCACGATGAA TCCCGGTGAT GTCCAGTTGC TGGAGACGCG GGCATTCAAC
ATCGAGGAAA TCTGCCGCTG GTATCGCGTT CCGCCGTTTA TGGTCGGCCA CAGCGAGAAA
TCGACAAGCT GGGGAACTGG GATCGAACAA CAGACGCTCG GCTTTTTGAC ATTCACCCTG
CGGCCTTGGT TGACGCGGAT TGAACAGGCA GCGCGACGGT CCCTGCTGAG GCCGGGAGAG
CGCGATCAGT TTTATGCGGA GTTCTCCGTC GAAGGGCTGT TGCGAGCCGA TAGTGCAGGC
CGAGCGGCGT TCTATTCAAC GATGACCCAA AACGGCCTGA TGACGCGTGA CGAATGTCGG
GCGAAGGAAA ACCTGCCGCC GATGGGTGGC AATGCAGCGG TGTTGACGGT TCAGTCGGCA
TTGCTCCCAA TCGACAAGCT CGGTGAGCAC ACGACGGCTA CGGCTGCGCA GGACGCCTTG
AAAGCGTGGC TCTACCAGGA GGAAAAAACA CGTGCAACGC AAGAACGGTA A
 
Protein sequence
MPISLTDGSF WSAWGGMGSS SGETVTADSA LQLSAVWSCV RLIAETIATL PLNLYQTKPD 
GTRVLAKQHR LYTVIHSQPN AENTAAEFWE VIVASMLLWG NGYARKLRPA GVLIGLELML
PQRTTVKRLT SGALQYTYRN VDGTVSTLAE DDVFHVRGFS LDGLMGLTPI QYAREVLGNS
TAANKTSASV FRNGLRPSGV LSTDQILQKE KRAEIRTDLA EQFGGAMQAG KTMVLEAGMK
YQAITMNPGD VQLLETRAFN IEEICRWYRV PPFMVGHSEK STSWGTGIEQ QTLGFLTFTL
RPWLTRIEQA ARRSLLRPGE RDQFYAEFSV EGLLRADSAG RAAFYSTMTQ NGLMTRDECR
AKENLPPMGG NAAVLTVQSA LLPIDKLGEH TTATAAQDAL KAWLYQEEKT RATQER