Gene BURPS668_A2971 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2971 
Symbol 
ID4887429 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2826955 
End bp2828388 
Gene Length1434 bp 
Protein Length477 aa 
Translation table11 
GC content71% 
IMG OID640132907 
ProductFHA domain-containing protein 
Protein accessionYP_001063962 
Protein GI126445261 
COG category[T] Signal transduction mechanisms 
COG ID[COG3456] Uncharacterized conserved protein, contains FHA domain 
TIGRFAM ID[TIGR03354] type VI secretion system FHA domain protein 


Plasmid Coverage information

Num covering plasmid clones33 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAACTGA CCGTTATCGA ACACGCGGGC GAGCCGGTCG GCACCGACGG CCGCAACGCC 
GTCGTGTTTC ATGCGCCGGG CGGCACGATC GGCCGGGACA GCGACAATCA CCTCGTGCTG
CGCGACGACA CCCGGCAGAT CTCGCGCCTG CAGGCGCTGC TGCAGGTGGC CGACGACGCG
TGCCTGCTGA AGAACCTGAG CAGCGTATCG ACGATCGAAG TGAACCGCGT GCCGATCGGC
TACGCGCAGG CGCAGCGCCT GAACATGGGC GACATCATCC GAATCGGCCC TTACCTGCTG
CGCGCGGAGC CCGACGACGC GACGATCGAG CGAACCGTCG AAGCCGCCAC CACGGCGGCC
GCGGCGGCGC CGGCGGCGTC CGCCGCGCAG GCTCAGGCGA AGGGGGCGGG CTACAAACTG
TGGGGCCTGC TGCACGAGCG CTTCGGGCTC GGCAAGGCAC AGGGCGCGGG CGAGCAGTCC
GGCGCGCGCG CCGCGCCGTC GCGCCACCAC GATTCGCCGG CGTCCGCTGC GCCGCGCGAC
CTGAATCAGC TGTCGACCGA TCCGCTCGAC CTGTTCGCGC AGCCGCGCGG CGATCCGGAT
GCGCGAGCCG GCGCTGCGCG CGAAGGCGAA GGCCGCGCGC CGCCCACCGT CACGCAACCG
GATCACGCGC CCGAGTGGAC GCAACACGTC CGCGTGCAGC CGGCGCAATC CGCGCCGCCC
GCCGCCTCTC GCCCCGGCGC GCCCGCCGCG CGTTCGGGCG ATATCCCCGC AGCAGGCGAT
GCGAGCGACA TGCCGTCGCG CGTGCGCGCG TCGCCGGCCC CCGCGCCGGC GACACCCGAG
ACATTGCTGC AGGCGTTCTT CGAAGGCGCG GGGCTCGACA CCGCCGCCGA GCAGCATCAC
TGGTCCGCCG AGCAGTTGTT CGTCGCGGGG CAGCTGCTCG CGCTGTTCGC CAACGGCACG
GTCGAGCTGC TGTCGTCACG CAGCATCCTG AAGCGCGAAG TGAAGGCCGA CATGACGATG
CTGCTCGACC GCGAGAACAA TCCGCTGAAG CTGCTGCCGG ACGGCAGCGC GGTGCTGCGC
CAGATGTTCG GGCTGCCGCT GCCGGGCTTC ATGACGCCGC AAAGCGCCGT GTCCGACGCG
TTCCAGGATC TGCACGCGCA CCAGATCGGC ATGGTGGCCG GCATGCGCGC CGCGCTGATG
GATCTGCTCA CGCGCTTCTC GCCGCAGCGC CTGCGCGAGC GCGACGCCGC GCCCCACTGG
TACGAGAAGC GCGTGCCGGC GCTGTACAAG GCGCGCCTCT GGGACCGCTA TGCAACCACG
CATCGCGACA CGCTGTTCGC GATCGAGGAC GATTTCGCCT CCGTGTTCGG CAAGGCGTTC
CTCAGCGCCT ACGACGCGGA AGTCGAGAGC TATCGCGGAC GCTGCCGCCG GTGA
 
Protein sequence
MQLTVIEHAG EPVGTDGRNA VVFHAPGGTI GRDSDNHLVL RDDTRQISRL QALLQVADDA 
CLLKNLSSVS TIEVNRVPIG YAQAQRLNMG DIIRIGPYLL RAEPDDATIE RTVEAATTAA
AAAPAASAAQ AQAKGAGYKL WGLLHERFGL GKAQGAGEQS GARAAPSRHH DSPASAAPRD
LNQLSTDPLD LFAQPRGDPD ARAGAAREGE GRAPPTVTQP DHAPEWTQHV RVQPAQSAPP
AASRPGAPAA RSGDIPAAGD ASDMPSRVRA SPAPAPATPE TLLQAFFEGA GLDTAAEQHH
WSAEQLFVAG QLLALFANGT VELLSSRSIL KREVKADMTM LLDRENNPLK LLPDGSAVLR
QMFGLPLPGF MTPQSAVSDA FQDLHAHQIG MVAGMRAALM DLLTRFSPQR LRERDAAPHW
YEKRVPALYK ARLWDRYATT HRDTLFAIED DFASVFGKAF LSAYDAEVES YRGRCRR