Gene BURPS1106A_A2831 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_A2831 
Symbol 
ID4906355 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009078 
Strand
Start bp2774607 
End bp2776478 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content71% 
IMG OID640145934 
Producthypothetical protein 
Protein accessionYP_001076860 
Protein GI126456530 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.843813 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACGC GCCTGCTCGA CTACTACAAC CGCGAGCTCG CGTATCTGCG CGAGTTGGGC 
GGCGAGTTCG CGCAGCAGTT TCCGAAAGTG GCCGCGCGCC TGCGGATGCA CGAATCGGGG
CCGCCCGATC CGTACGTCGA GCGGCTGCTC GAAGGCTTCA GCTTTCTCAC CGCGCGCGTG
CAACTGAAGA TGGACGCGGA GTTTCCGCGC TTCACGCAGG CGCTGCTCGA CGCGGTGTAT
CCGGGTTACG TCGCGCCGCT TCCGTCGATG GCGATCGTGC AGTTCACGCC GATGATGAAC
GAAGGCAGCC TCGCGCAGGG CTACCGGCTG CCGGCGGGCA CCGCGCTGCG CGCGCGGCCC
GCCGCGGCCG AACAGACCGC GTGCGAGTTT CGCACCGCGC ACGATCTGAC GCTGTGGCCG
CTGGAGCTCG CGGGCGCTTC GGTGACGGGC GCGCCCGCGT ATCTGCCGCG TTCGGCGACG
GCCGCGCGCC GCGACGTGCG CGGCGCGCTG CGCATCCGGC TGAAGGCGCG CGGCGGCGCG
GGCCTCGCGC AACTGCCGAT CGATCGGCTG ATGTTCCACC TGGCGGGCCC CGAGCGCGAC
GCGCTGCATC TGCTCGAACT GATCGCCGGG CATACGATCG GCGTCGTCTG CCACGACGCG
GCGCAGCCGC CGCGCTGGCT GCACGCGCTT GGCGCGCACG CGCTCGCGCA TCAGGGCTTC
GACGCCGATC AGGCGCTGCT GCCCGACGAA GGCCGCAGCT TCCACGGTTA CCGGCTGCTG
CGCGAGTACT TCGCGTTTCC CGCGCGCTTC CTGTTCTTCA GCATCGAAGG ATTGCGGCCC
GCGCTCGCGC GCGCGACGGG CGACACGTTC GAGCTGACGC TGCTGCTCGA TCGGCACGAC
GCGGCGCTCG AGAACAGCGT CGATGCGCGG CACCTCGCGT TGAACTGCAC GCCGGCCGTC
AACCTGTTCG CGCGGCGCGC GGACCGCATT CCGGTCCATC CGGGCGCGCG CGAGCATCAT
GTCGTCGTCG ATCGCAGCCG GCCGCTCGAC TACGAGGTCT ACGCGGTGCG GCGGCTCGCG
GGCGAGCAGC GCGACGACGG GCAGATGCGC GCGTTCCGGC CGTTCCATGC GTCGTTCGCG
GGCGACGGCG GCAATTACGG CGCGTACTAC ACGGTGCGCC GCGAGCCGCG CCTCGTGTCC
GCGCAGGCGC GCGCGAACGG CACGCGCACC GGCTACGTCG GCAGCGAGAC GTTCGTGTCG
CTCGTCGATA GCGCGTGCGC GCCGTATGAC GAATCGATCC GCTATCTGTC CGTCGACACG
CTGTGCACGA ACCGCGATCT CGTCCTGCTG TTGCCGGCGG GCGACGCGAA CGCGTTCACG
CTGCGCGTGT CGGCGCCCGT CGAGCGGATC GCCATGATCC GCGGGCCGTC GCGGCCGCGC
CCGCCGCTCG CCGACGCGCA GAGCGCGTGG CGGCTCGTGA GCCATCTCGG GCTCGCGCGC
CACACGCTGA CCGATGTCGA CGACGAAGAA GGCGCGCGCG TGCTGCGCGA ATTGCTCGGC
CTGCACGCGG ACCCGGCCGA TGCGGCGATG CGCCGGCAGA TCGACGGCGT GCATCGTGTC
GCGTTCGCGC CGGTGTTTCG CCGGCTGCCC GCCGCCGGGC CGCTGATGTT CGGGCGCGGC
GTGCAGGTGG ACGTGACCGT CGACGATCAT GCGTTCTCCG GCGACAGCCC CTATTTGCTC
GGCGCGGTGC TCGAGCAGTT TTTCGCGCGG CACGTGTCGA TCAACTCGTT CGCCGAATGC
GTGCTGAGCA GCGCGCAGCG CGGCAGGCTC GCGCAATGGC CGGCGCGCGT CGGCAGGCGG
CCCGCGATAT GA
 
Protein sequence
MDTRLLDYYN RELAYLRELG GEFAQQFPKV AARLRMHESG PPDPYVERLL EGFSFLTARV 
QLKMDAEFPR FTQALLDAVY PGYVAPLPSM AIVQFTPMMN EGSLAQGYRL PAGTALRARP
AAAEQTACEF RTAHDLTLWP LELAGASVTG APAYLPRSAT AARRDVRGAL RIRLKARGGA
GLAQLPIDRL MFHLAGPERD ALHLLELIAG HTIGVVCHDA AQPPRWLHAL GAHALAHQGF
DADQALLPDE GRSFHGYRLL REYFAFPARF LFFSIEGLRP ALARATGDTF ELTLLLDRHD
AALENSVDAR HLALNCTPAV NLFARRADRI PVHPGAREHH VVVDRSRPLD YEVYAVRRLA
GEQRDDGQMR AFRPFHASFA GDGGNYGAYY TVRREPRLVS AQARANGTRT GYVGSETFVS
LVDSACAPYD ESIRYLSVDT LCTNRDLVLL LPAGDANAFT LRVSAPVERI AMIRGPSRPR
PPLADAQSAW RLVSHLGLAR HTLTDVDDEE GARVLRELLG LHADPADAAM RRQIDGVHRV
AFAPVFRRLP AAGPLMFGRG VQVDVTVDDH AFSGDSPYLL GAVLEQFFAR HVSINSFAEC
VLSSAQRGRL AQWPARVGRR PAI