Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_A2120 |
Symbol | |
ID | 4886471 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009075 |
Strand | - |
Start bp | 2053416 |
End bp | 2055323 |
Gene Length | 1908 bp |
Protein Length | 635 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640132057 |
Product | ImpA-related N-terminal family protein |
Protein accession | YP_001063114 |
Protein GI | 126443381 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03362] type VI secretion-associated protein, VC_A0119 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 0.570001 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGA GCGAACGGCG CCCGCCCGGC GGCGCGACGG CGCGCGCGCG CATGCCGATC GATGTCGAAG CGCTCGCCGT GCTGGGGCGC ACGGACATCG ATTCCGCCAT GCCCGCGGGC GCCGACGTGC GCGCCGACGC GAGGTTCGAC GCGCTGCACG CGGAGCTTGC GAAGCTCGCG TCGCCGGGCG CGAGCGGGCA AGTCGATTGG CGCGCGGCGA CGCATCTCGC CGCCGAATTG CTGCGCGAGC GCGGCAAGGA TTTGCTCGTC GGCTGCTATC TGGCGGGTGC GTTGCTGCAG ACGGGCGGCG CGGCGGGGCT GCGCTGCGGA CTCGAAATCG TCGGCGATCT CGTCGAACGT CATTGGGATG CGATGTCGCC GCCCGTGTCG CGGATGCGGG CGAGGCGCGG GGCGCTGCAA TGGCTGGTCG ATCGCGTCGA CGCCATGCAC GATGCAGGAG CCGCGGCATG CGGCGGCGCG TGCTCGGCCG AACTGGTCGC GCAATTGCGC GCGGCCGCGC GGCGCATCGA TGCGCTGCTC GCCGAGCGCG ACGACGACGC GCCGACGATG CGCGCGGTGC ATGCGTTCGC GCAGCGATTG CCGGTTGAGG TGGTTGAGGT GGTGGAAGTG GCTGACGAGG CTGATGAGGC TGATGAGGCT GAGACGGCTC AGACGGCTCA GACGGCTCAG ACGGCTCAGA CGGCTCAGAC GGCTCAGACG GCCGAGACGG CTGAGACGGC CGAGACGGCC GAGACGGCCG AGACGGCCGA GGCCGATGCG CACGGCTCGA CGGGAGGGCC GGCCGCGGAA ATCGCGATTG CCGCCGCCGA ACAGGCTTTG ATTGATCCGG CCGGTCGAGC CGCGCCGAGC GCCGGCACGG ATACGAACGC GAACGCAGAC GCCGCCGGGC AACCGGCGCG GCTCGACGAA GCGGCCGGCC GCGAACGCGC GCTCGCCGAT GCGCTCGCGC AACTGCATTG CGTCGCGACG GCGTTCGCGC AAGCGGACTG GGCCGACGCG CGCGGCTTCC GGCTGCGCCG CGTCGCGTGC TGGTCGAGCG TGTGCGCGCT GCCGGAAACG GACACGGAGA ACGGAAGAAC GCGGATCGCC GCGCCGAGCG CTTCGATCGT CGGCGCGGCG AAGAACATCG ACGGGGATGG CGAGCCCGTG GCGGCGGTGC GCTTCGCCGA AGCGCATGCG CAGGCGTTCC CGCTCTGGCT GGATTTGCAG CGCATCGCCG CGCGCGCGCT CGCGCGCGCG GGGGGCGACG GCGCCGATGC GCGGCGCGAA GTGGAGACGG CGGTTCGTGC ACTGCTTGCG CGGCTGCCGG GCCTCGACGC GCTGACGTTC GCGGACGGCA CGCCGTTCGC CGACGACGCG ACGCGCGCAT GGCTCGGCGA GCTTGGCGCG CCTGTTGTGG CGGCGGATGC GGTGTCGCCG TCGTCTTTGC CGCTTTCGCC GCGACCTTCG CCGCCTGAGC GATCGTCGCC GATGGCGGGC GAACCGGCGC GCGCGCCGGG CGATGCGTGC GGGGCGAGCG CCGACGATGC AGTGGACCGA GCGTGCGCGT TTGCCGCGAG CGGCCAGCTC GATCTCGCGC TCCACGCGAT TCAGCATGCG ATCGATCGTG CGACGAGCGC CGAACAGCGG TTGAGAGCGC GCGTGCGGTT GTGCGAGCTT GCGCGCGACC ATTGGCCGCA TGAGGTTCCT GAGGCGTTCG CGCGCGGCGT GATCGAACCG ATTCGGCGGC ACGACTTGCT CGCATGGAAT CCGGAGCTGG CGCTCGACGG CTTGTCGGCC GCCTATGCGC TGCTGATTCG GCGCGATCGC GAATCGGCGC ACGCGAGGAC GGTGCTTGAC GAGATCGCGA GCGTCGACGC GGCGCGGGCC ATGCGTTTGT CGACGTGA
|
Protein sequence | MGMSERRPPG GATARARMPI DVEALAVLGR TDIDSAMPAG ADVRADARFD ALHAELAKLA SPGASGQVDW RAATHLAAEL LRERGKDLLV GCYLAGALLQ TGGAAGLRCG LEIVGDLVER HWDAMSPPVS RMRARRGALQ WLVDRVDAMH DAGAAACGGA CSAELVAQLR AAARRIDALL AERDDDAPTM RAVHAFAQRL PVEVVEVVEV ADEADEADEA ETAQTAQTAQ TAQTAQTAQT AETAETAETA ETAETAEADA HGSTGGPAAE IAIAAAEQAL IDPAGRAAPS AGTDTNANAD AAGQPARLDE AAGRERALAD ALAQLHCVAT AFAQADWADA RGFRLRRVAC WSSVCALPET DTENGRTRIA APSASIVGAA KNIDGDGEPV AAVRFAEAHA QAFPLWLDLQ RIAARALARA GGDGADARRE VETAVRALLA RLPGLDALTF ADGTPFADDA TRAWLGELGA PVVAADAVSP SSLPLSPRPS PPERSSPMAG EPARAPGDAC GASADDAVDR ACAFAASGQL DLALHAIQHA IDRATSAEQR LRARVRLCEL ARDHWPHEVP EAFARGVIEP IRRHDLLAWN PELALDGLSA AYALLIRRDR ESAHARTVLD EIASVDAARA MRLST
|
| |