Gene BURPS668_A2120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_A2120 
Symbol 
ID4886471 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009075 
Strand
Start bp2053416 
End bp2055323 
Gene Length1908 bp 
Protein Length635 aa 
Translation table11 
GC content72% 
IMG OID640132057 
ProductImpA-related N-terminal family protein 
Protein accessionYP_001063114 
Protein GI126443381 
COG category[S] Function unknown 
COG ID[COG3515] Uncharacterized protein conserved in bacteria 
TIGRFAM ID[TIGR03362] type VI secretion-associated protein, VC_A0119 family 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value0.570001 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGGATGA GCGAACGGCG CCCGCCCGGC GGCGCGACGG CGCGCGCGCG CATGCCGATC 
GATGTCGAAG CGCTCGCCGT GCTGGGGCGC ACGGACATCG ATTCCGCCAT GCCCGCGGGC
GCCGACGTGC GCGCCGACGC GAGGTTCGAC GCGCTGCACG CGGAGCTTGC GAAGCTCGCG
TCGCCGGGCG CGAGCGGGCA AGTCGATTGG CGCGCGGCGA CGCATCTCGC CGCCGAATTG
CTGCGCGAGC GCGGCAAGGA TTTGCTCGTC GGCTGCTATC TGGCGGGTGC GTTGCTGCAG
ACGGGCGGCG CGGCGGGGCT GCGCTGCGGA CTCGAAATCG TCGGCGATCT CGTCGAACGT
CATTGGGATG CGATGTCGCC GCCCGTGTCG CGGATGCGGG CGAGGCGCGG GGCGCTGCAA
TGGCTGGTCG ATCGCGTCGA CGCCATGCAC GATGCAGGAG CCGCGGCATG CGGCGGCGCG
TGCTCGGCCG AACTGGTCGC GCAATTGCGC GCGGCCGCGC GGCGCATCGA TGCGCTGCTC
GCCGAGCGCG ACGACGACGC GCCGACGATG CGCGCGGTGC ATGCGTTCGC GCAGCGATTG
CCGGTTGAGG TGGTTGAGGT GGTGGAAGTG GCTGACGAGG CTGATGAGGC TGATGAGGCT
GAGACGGCTC AGACGGCTCA GACGGCTCAG ACGGCTCAGA CGGCTCAGAC GGCTCAGACG
GCCGAGACGG CTGAGACGGC CGAGACGGCC GAGACGGCCG AGACGGCCGA GGCCGATGCG
CACGGCTCGA CGGGAGGGCC GGCCGCGGAA ATCGCGATTG CCGCCGCCGA ACAGGCTTTG
ATTGATCCGG CCGGTCGAGC CGCGCCGAGC GCCGGCACGG ATACGAACGC GAACGCAGAC
GCCGCCGGGC AACCGGCGCG GCTCGACGAA GCGGCCGGCC GCGAACGCGC GCTCGCCGAT
GCGCTCGCGC AACTGCATTG CGTCGCGACG GCGTTCGCGC AAGCGGACTG GGCCGACGCG
CGCGGCTTCC GGCTGCGCCG CGTCGCGTGC TGGTCGAGCG TGTGCGCGCT GCCGGAAACG
GACACGGAGA ACGGAAGAAC GCGGATCGCC GCGCCGAGCG CTTCGATCGT CGGCGCGGCG
AAGAACATCG ACGGGGATGG CGAGCCCGTG GCGGCGGTGC GCTTCGCCGA AGCGCATGCG
CAGGCGTTCC CGCTCTGGCT GGATTTGCAG CGCATCGCCG CGCGCGCGCT CGCGCGCGCG
GGGGGCGACG GCGCCGATGC GCGGCGCGAA GTGGAGACGG CGGTTCGTGC ACTGCTTGCG
CGGCTGCCGG GCCTCGACGC GCTGACGTTC GCGGACGGCA CGCCGTTCGC CGACGACGCG
ACGCGCGCAT GGCTCGGCGA GCTTGGCGCG CCTGTTGTGG CGGCGGATGC GGTGTCGCCG
TCGTCTTTGC CGCTTTCGCC GCGACCTTCG CCGCCTGAGC GATCGTCGCC GATGGCGGGC
GAACCGGCGC GCGCGCCGGG CGATGCGTGC GGGGCGAGCG CCGACGATGC AGTGGACCGA
GCGTGCGCGT TTGCCGCGAG CGGCCAGCTC GATCTCGCGC TCCACGCGAT TCAGCATGCG
ATCGATCGTG CGACGAGCGC CGAACAGCGG TTGAGAGCGC GCGTGCGGTT GTGCGAGCTT
GCGCGCGACC ATTGGCCGCA TGAGGTTCCT GAGGCGTTCG CGCGCGGCGT GATCGAACCG
ATTCGGCGGC ACGACTTGCT CGCATGGAAT CCGGAGCTGG CGCTCGACGG CTTGTCGGCC
GCCTATGCGC TGCTGATTCG GCGCGATCGC GAATCGGCGC ACGCGAGGAC GGTGCTTGAC
GAGATCGCGA GCGTCGACGC GGCGCGGGCC ATGCGTTTGT CGACGTGA
 
Protein sequence
MGMSERRPPG GATARARMPI DVEALAVLGR TDIDSAMPAG ADVRADARFD ALHAELAKLA 
SPGASGQVDW RAATHLAAEL LRERGKDLLV GCYLAGALLQ TGGAAGLRCG LEIVGDLVER
HWDAMSPPVS RMRARRGALQ WLVDRVDAMH DAGAAACGGA CSAELVAQLR AAARRIDALL
AERDDDAPTM RAVHAFAQRL PVEVVEVVEV ADEADEADEA ETAQTAQTAQ TAQTAQTAQT
AETAETAETA ETAETAEADA HGSTGGPAAE IAIAAAEQAL IDPAGRAAPS AGTDTNANAD
AAGQPARLDE AAGRERALAD ALAQLHCVAT AFAQADWADA RGFRLRRVAC WSSVCALPET
DTENGRTRIA APSASIVGAA KNIDGDGEPV AAVRFAEAHA QAFPLWLDLQ RIAARALARA
GGDGADARRE VETAVRALLA RLPGLDALTF ADGTPFADDA TRAWLGELGA PVVAADAVSP
SSLPLSPRPS PPERSSPMAG EPARAPGDAC GASADDAVDR ACAFAASGQL DLALHAIQHA
IDRATSAEQR LRARVRLCEL ARDHWPHEVP EAFARGVIEP IRRHDLLAWN PELALDGLSA
AYALLIRRDR ESAHARTVLD EIASVDAARA MRLST