Gene BURPS668_1575 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1575 
Symbol 
ID4884209 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1540418 
End bp1542268 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content69% 
IMG OID640127503 
Producthypothetical protein 
Protein accessionYP_001058616 
Protein GI126440529 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.3632 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATGAAC TCCTCTCATA CTACGAACGG GAACTGGCGT TCCTGCGCCG GCACGTGCGC 
GATTTCGCCG AGCGCTATCC GAAGATCGCG AGCCGCATGC AGCTCGCGAG CGGCGCGGAC
GGCAGCAGCG AGGATCCCCA GGTCGAGCGG CTGTTCGAAT CATTCGCGCT GACGGGCGCG
CGCGCGTCGC GGCACATCGA CGACGACTAC TCCGAGTTCA CGAAGGCGTT CGTCGAGGTG
CTGTACCCGC ATTATCTGCG CGTGTTCCCG TCGTGCTCGA TCGCGTCGTT CGACATCGAT
GCGAGCCGCG CCGCGCAGAT GTCGGCGGCC GTGGTCGTGC CGCGCGGCAC GCAGCTGTAC
AGCCGGCCCG TGAGGGGCGC GAAGCTGTTC TTCCGCACCG CGTACGACGT GACGCTGTCG
CCGCTGCAAC TGACGGCCGC GCGCTTTCAC GCGATGCCGC AGGCGCCGCG TTCGTTCCGG
CTGCCGCCGA ACGCGAGCGC GCAGCTCTCG CTGTCGTTCG CGATCCGCTC GCCGCACGCG
AGCGTCGCCG ATCTGAAGCT CGATTCCGTT CGCCTGTACA CGCGCGGCGA GCCGCTGATG
AGCGCCGCGC TGCGCGACGC GCTGTCGATT CATGCATTGC AGGCGTACGT CGAGCCGGAG
CAGGGCGGGC GCTGGGTGGC GCTCGAGCGC GTGCCCTTCG CGGCCGTCGG CGTGTCGCGC
GAGGACAGCC TGATTCCATG CCCGCAGGGT GTGCATCCCG CGTATCCGCT GCTGACCGAG
TATTTCGCCT TTGCCGAGAA ATTCGGCTTC TTCGATTGCG ATCTGCGCGA GGCCGGGCGC
CTCGGCAGGC GGCACTTCAC GCTTCATCTG CTGCTCAAGG GCATCCCGGC CGATTCAGCG
AAGGCGGGCG TGCTCGAATC GCTGAGCGCC GAGCACGTGC TGCTCGGCTG CACGCCGGTG
ATCAATCTGT TCGAGACGAC AGGCAAGCTC GGCCAGCAGC CGAGCGCCGC GTCGGGCGCG
CACATGCAGC CGCTCGTCGT CGACAAGCAG AATGCCTATG CATACGAAGT CTATTCGGTC
GATGCGGTGA CGCAGGTCCA GGAGACGCCG CAGGGCGAGC GCGTCACGAC GTTCCCGTCG
CTGCATTCGC TGTACCACGG CGGGCAGGCG GCGCGGGCGT CGCTGTACTG GCGCATGCGG
CGCGACGCGC TTGTCGCGAG AAGCGAGCCG GGGCACGAGC TGTCGCTCGG CTTCGTCGAC
GGCGCGCTCG ATCCGGTCGC GGCGCCGGCC GGCCTCGATT TCAAGCTCAC ATGCAGCAAC
CGCGATTTGC CGGAGCATCT GCCGCACGGC GCGCCGGGCG GCGATCTGAT GATGGAAGGC
GGCACGCTCG CGAGCCGGAT CGGCCTGTTG CAGCGGCCCA CGCGGCCGCT GCGCTTGCGC
GAGGATCGCG GCGTGCTGTG GCGTCTCGTG TCGCAGCTCT CGTCGAACTC GCTGTTGCTC
GCCGGCGGCG CCGGCGCCGT GCGTGACCTG CTCAGGCTGC ACGACGTGCA GGAGTCGCCC
GCGACCGTGC GCCAGATCGC GGGCATCGTC GACGTGTCGC AGAAGCCCGT GACCGCGTGG
GTATCCGAGA AGCCGTTCGC GAGCGTCGTG CGGGGGCTCG AGATTCGCAT CACGGTCGAC
GAAGAGTGCT TCGCGGGCAC GGGTGTCCAC ACCTTCGCGC AACTGATGGA TTGCCTGTTG
AGCCGATACG TCGCGCCGAA CGGCTTCACG CAGCTCGTGC TCGTGTCGAG CCGAACCGGC
GACGTGCTGT GCACGTGCGC GCGTCGCGCG GGCGGCGGAT TCCTGATCTA G
 
Protein sequence
MDELLSYYER ELAFLRRHVR DFAERYPKIA SRMQLASGAD GSSEDPQVER LFESFALTGA 
RASRHIDDDY SEFTKAFVEV LYPHYLRVFP SCSIASFDID ASRAAQMSAA VVVPRGTQLY
SRPVRGAKLF FRTAYDVTLS PLQLTAARFH AMPQAPRSFR LPPNASAQLS LSFAIRSPHA
SVADLKLDSV RLYTRGEPLM SAALRDALSI HALQAYVEPE QGGRWVALER VPFAAVGVSR
EDSLIPCPQG VHPAYPLLTE YFAFAEKFGF FDCDLREAGR LGRRHFTLHL LLKGIPADSA
KAGVLESLSA EHVLLGCTPV INLFETTGKL GQQPSAASGA HMQPLVVDKQ NAYAYEVYSV
DAVTQVQETP QGERVTTFPS LHSLYHGGQA ARASLYWRMR RDALVARSEP GHELSLGFVD
GALDPVAAPA GLDFKLTCSN RDLPEHLPHG APGGDLMMEG GTLASRIGLL QRPTRPLRLR
EDRGVLWRLV SQLSSNSLLL AGGAGAVRDL LRLHDVQESP ATVRQIAGIV DVSQKPVTAW
VSEKPFASVV RGLEIRITVD EECFAGTGVH TFAQLMDCLL SRYVAPNGFT QLVLVSSRTG
DVLCTCARRA GGGFLI