Gene BURPS668_A3126 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A3126 
Symbol 
ID4888889 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2965529 
End bp2966665 
Gene Length1137 bp 
Protein Length378 aa 
Translation table11 
GC content71% 
IMG OID640133062 
ProductmipA family protein 
Protein accessionYP_001064117 
Protein GI126444960 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3713] Outer membrane protein V 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value0.0524278 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGATA CGCGACACGC GCCGCGCGCG CGCCTCGCGT CGATGGCGGC GCGTGCGCTC 
GCGTGCGCGC GCCGCTGCCG CCGCTCGAGC GAGACAGGCG CGGATGCGCC GAGCGATGCG
CGCCGGCCAT TGCCCGCCGG GGCGGCGGTG CGGCTTGCCG CGCCGTGTGC CGGCCGCGCG
CGACTTTCGC GCGAAAACGC GCAGTACCGA CTACAACGCG AACAAAGAGG ACAGCACATG
AGCGACGCAC GAGCCGTTTC GATCCGGGGA ACCGCCGCTG TGCGCGACGC GGCGGCGCGC
AGGCGCACCG CGCGCGGCAT CGTCTGCGCG GCGTGCGCGG CGGCGGCCGT ATCCGCCCAC
GCGCAGACAC CATCGCCGCT CGGCGAGTGG CAGTATTCGG CCGGCGTGCC GCTCGACAAG
CTCTTCAATC CGAACCCGCA GACATGGCAG ATCTCGGTCG GCGCGGCCGC GACGCTGCAG
CCGCGCTACG ACGGCTCGAA CCAGTACCGG CCGATGGCCG GGCCGACCTT CGACGCCCGC
TATCGCGACC TGTGGTTCGT GTCGACGGGC GAGGGAATCG GCGTCAACGT GCTGCGCGGG
CCGAACTGGC GCGCGACGCT GTCGGCGGGC TATGACCTCG GCCGCCGCGA GGCCGACGAC
CGCGGCCATC TGACGGGCAT CGGCAACATC AATCCGGCGG CGGTGATCAA GCTGTCGGCC
GATTACGTGA TCTCACACGC GTTCCCGCTT GTGCTGCGCG CGGACGTGCG GCGCAGCGTC
GGCGGCGCGA ACGGCTGGGT GGCCGATCTC GCCGCCTACA TGCCGCTGCC CGGCAGCTCG
GAGACGTTCT ACTGGTTCGC GGGGCCGACC GTCACGTTCG CCGATTCGCG CTACATGAAC
AGCTGGTTCG GCGTGAACGA CGCGCAGGCC GCGCGTTCCG GACATCCGCG TTACGCGTCG
AGCGCGGGCG TGAAATCGTT CGGCGGCGGC ATGACGCTCG TGTGGTTCGT CACGAAGCAC
TGGTTCGTGA CGGCCGACGG CGCGATCGAG CAGCTCGTCG GCAGCGCCGC GCGCAGTCCG
CTCACCCAGC GCTCGACGAA CGCGGTCGTC GACGTGTCGA TCAATTACCA GTTCTAG
 
Protein sequence
MGDTRHAPRA RLASMAARAL ACARRCRRSS ETGADAPSDA RRPLPAGAAV RLAAPCAGRA 
RLSRENAQYR LQREQRGQHM SDARAVSIRG TAAVRDAAAR RRTARGIVCA ACAAAAVSAH
AQTPSPLGEW QYSAGVPLDK LFNPNPQTWQ ISVGAAATLQ PRYDGSNQYR PMAGPTFDAR
YRDLWFVSTG EGIGVNVLRG PNWRATLSAG YDLGRREADD RGHLTGIGNI NPAAVIKLSA
DYVISHAFPL VLRADVRRSV GGANGWVADL AAYMPLPGSS ETFYWFAGPT VTFADSRYMN
SWFGVNDAQA ARSGHPRYAS SAGVKSFGGG MTLVWFVTKH WFVTADGAIE QLVGSAARSP
LTQRSTNAVV DVSINYQF