Gene BURPS1106A_1601 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1601 
Symbol 
ID4900354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1551397 
End bp1553298 
Gene Length1902 bp 
Protein Length633 aa 
Translation table11 
GC content69% 
IMG OID640134831 
Producthypothetical protein 
Protein accessionYP_001065872 
Protein GI126454611 
COG category[S] Function unknown 
COG ID[COG3519] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03359] type VI secretion protein, VC_A0110 family 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGTCGTCGG CATCGGCCGC CGGCGTGCGC AGGCACTTTC GAGAACCGCT GATGGATGAA 
CTCCTCTCAT ACTACGAACG GGAACTGGCG TTCCTGCGCC GCCACGTGCG CGATTTCGCC
GAGCGCTATC CGAAGATCGC GAGCCGCATG CAGCTCGCGA GCGGCGCGGA CGGCAGCAGC
GAGGATCCCC AGGTCGAGCG GCTGTTCGAA TCATTCGCGC TGACGGGCGC GCGCGCGTCG
CGGCACATCG ACGACGACTA CTCCGAGTTC ACGAAGGCGT TCGTCGAGGT GCTGTACCCG
CATTATCTGC GCGTGTTCCC GTCGTGCTCG ATCGCGTCGT TCGACATCGA TGCGAGCCGC
GCCGCGCAGA TGTCGGCGGC CGTGGTCGTG CCGCGCGGCA CGCAGCTGTA CAGCCGGCCC
GTGAGGGGCG CGAAGCTGTT CTTCCGCACC GCGTACGACG TGACGCTGTC GCCGCTGCAA
CTGACGGCCG CGCGCTTTCA CGCGATGCCG CAGGCGCCGC GTTCGTTCCG GCTGCCGCCG
AACGCGAGCG CGCAGCTCTC GCTGTCGTTC GCGATCCGCT CGCCGCACGC GAGCGTCGCC
GATCTGAAGC TCGATTCCGT TCGCCTGTAC ACGCGCGGCG AGCCGCTGAT GAGCGCCGCG
CTGCGCGACG CGCTGTCGAT TCATGCATTG CAGGCGTACG TCGAGCCGGA GCAGGGCGGG
CGCTGGGTGG CGCTCGAGCG CGTGCCCTTC GCGGCCGTCG GCGTGTCGCG CGAGGACAGC
CTGATTCCTT GCCCGCAGGG TGTGCATCCC GCGTATCCGC TGCTGACCGA GTATTTCGCC
TTTGCCGAGA AATTCGGCTT CTTCGATTGC GATCTGCGCG AGGCCGGGCG CCTCGGCAGG
CGGCACTTCA CGCTTCATCT GCTGCTCAAG GGCATCCCGG CCGATTCCGC GAAGGCGGGC
GTGCTCGAAT CGCTGAGCGC CGAGCACGTG CTGCTCGGCT GCACGCCGGT GATCAATCTG
TTCGAGACGA CAGGCAAGCT CGGCCAGCAG CCGAGCGCCG CGTCGGGCGC GCACATGCAG
CCGCTCGTCG TCGACAAGCA GAATGCCTAT GCATACGAAG TCTATTCGGT CGATGCGGTG
ACGCAGGTCC AGGAGACGCC GCAGGGCGAG CGCGTCACGA CGTTCCCGTC GCTGCATTCG
CTGTACCACG GCGGGCAGGC GGCGCGGGCG TCGCTGTACT GGCGCATGCG GCGCGACGCG
CTTGTCGCGA GAAGCGAGCC GGGGCACGAG CTGTCGCTCG GCTTCGTCGA CGGCGCGCTC
GATCCGGTCG CGGCGCCGGC CGGCCTCGAT TTCAAGCTCA CATGCAGCAA CCGCGATTTG
CCGGAGCATC TGCCGCACGG CGCGCCGGGC GGCGATCTGA TGATGGAAGG CGGCACGCTC
GCGAGCCGGA TCGGCCTGTT GCAGCGGCCC ACGCGGCCGC TGCGCTTGCG CGAGGATCGC
GGCGTGCTGT GGCGTCTCGT GTCGCAGCTC TCGTCGAACT CGCTGTTGCT CGCCGGCGGC
GCCGGCGCCG TGCGTGACCT GCTCGGGCTG CACGACGTGC AGGAGTCGCC CGCGACCGTG
CGTCAGATCG CGGGCATCGT CGACGTGTCG CAGAAGCCCG TGACCGCGTG GGTATCCGAG
AAGCCGTTCG CGAGCGTCGT GCGGGGGCTC GAGATTCGCA TCACGGTCGA CGAAGAGTGC
TTCGCGGGCA CGGGTGTCCA CACCTTCGCG CAACTGATGG ATTGCCTGTT GAGCCGATAC
GTCGCGCCGA ACGGCTTCAC GCAGCTCGTG CTCGTGTCGA GCCGAACCGG CGACGTGCTG
TGCACGTGCG CGCGTCGCGC GGGCGGCGGA TTCCTGATCT AG
 
Protein sequence
MSSASAAGVR RHFREPLMDE LLSYYERELA FLRRHVRDFA ERYPKIASRM QLASGADGSS 
EDPQVERLFE SFALTGARAS RHIDDDYSEF TKAFVEVLYP HYLRVFPSCS IASFDIDASR
AAQMSAAVVV PRGTQLYSRP VRGAKLFFRT AYDVTLSPLQ LTAARFHAMP QAPRSFRLPP
NASAQLSLSF AIRSPHASVA DLKLDSVRLY TRGEPLMSAA LRDALSIHAL QAYVEPEQGG
RWVALERVPF AAVGVSREDS LIPCPQGVHP AYPLLTEYFA FAEKFGFFDC DLREAGRLGR
RHFTLHLLLK GIPADSAKAG VLESLSAEHV LLGCTPVINL FETTGKLGQQ PSAASGAHMQ
PLVVDKQNAY AYEVYSVDAV TQVQETPQGE RVTTFPSLHS LYHGGQAARA SLYWRMRRDA
LVARSEPGHE LSLGFVDGAL DPVAAPAGLD FKLTCSNRDL PEHLPHGAPG GDLMMEGGTL
ASRIGLLQRP TRPLRLREDR GVLWRLVSQL SSNSLLLAGG AGAVRDLLGL HDVQESPATV
RQIAGIVDVS QKPVTAWVSE KPFASVVRGL EIRITVDEEC FAGTGVHTFA QLMDCLLSRY
VAPNGFTQLV LVSSRTGDVL CTCARRAGGG FLI