Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_A2022 |
Symbol | |
ID | 4904348 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009078 |
Strand | - |
Start bp | 1988222 |
End bp | 1990093 |
Gene Length | 1872 bp |
Protein Length | 623 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 640145127 |
Product | ImpA-related N-terminal family protein |
Protein accession | YP_001076055 |
Protein GI | 126457583 |
COG category | [S] Function unknown |
COG ID | [COG3515] Uncharacterized protein conserved in bacteria |
TIGRFAM ID | [TIGR03362] type VI secretion-associated protein, VC_A0119 family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.975565 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGGGATGA GCGAACGGCG CCCGCCCGGC GGCGCGGCGG CGCGCGCGCG CATGCCGATC GATGTCGAAG CGCTCGCCGT GCTGGGGCGC ACGGACATCG ATTCCGCCAT GCCCGCGGGC GCCGACGTGC GCGCCGACGC GAGGTTCGAC GCGCTGCACG CGGAGCTTGC GAAGCTCGCG TCGCCGGGCG CGAGCGGGCA AGTCGATTGG CGCGCGGCGA CGCATCTCGC CGCCGAATTG CTGCGCGAGC GCGGCAAGGA TTTGCTCGTC GGCTGCTATC TGGCGGGTGC GTTGCTGCAG ACGGGCGGCG CGGCGGGGCT GCGCTGCGGA CTCGAAATCG TCGGCGATCT CGTCGAACGT CATTGGGATG CGATGTCGCC GCCCGTGTCG CGGATGCGGG CGAGGCGCGG GGCGCTGCAA TGGCTGGTCG ATCGCGTCGA CGCCATGCAC GATGCAGGAG CCGCCGCATG CGGCGGCGCG TGCTCGGCCG AACTGGTCGC GCAATTGCGC GCGGCCGCGC GGCGCATCGA TGCGCTGCTC GCCGAGCGCG ACGACGACGC GCCGACGATG CGCGCGGTGC ATGCGTTCGC GGAGCGATTG CCGGTTGAGG TGGTTGAGGT GGTGGAAGTG GCTGACGAGG CTGATGTGGC TGAGGCGACT GAGGCGACTG AGGCGACTGA GGCGACTGAG GCGACTGAGG CGGCTGATGT GGCTGAGACG GCTGAGACGG CCGAGGCCGA TGCGCATGGC TCGACGGGAG GGCCGGCCGC GGAAATCGCG ATTGCCGCTG CCGAACAGGC TTTGATTGAT CCGGCCGGTC GAGCCGCGCC GAGCGCAGGC ACGGATACGA ACGCGAACGC AGACGCCGCC AGGCAACCGG CGCGGCTCGA CGAAGCGGCC GGCCGCGAAC GCGCGCTCGC CGATGCGCTC GCGCAACTGC ATTGCGTCGC GACGGCGTTC GCGCAAGCGG ACTGGGCCGA CGCGCGCGGC TTCCGGCTGC GCCGCGTCGC GTGCTGGTCG AGCGTGTGCG CGCTGCCGGA AACGGACGCG GAGAACGGAA GAACGCGGAT CGCCGCGCCG AGCGCTTCGA TCGTCGGCGC GGCGAAGAAC ATCGACGGGG ATGGCGAGCC TGTGGCGGCG GTGCGCTTCG CCGAAGCGCA TGCGCAGGCG TTCCCGCTCT GGCTGGATTT GCAGCGCATC GCCGCGCGCG CGCTCGCGCG CGCGGGGGGC GACGGCGCCG ATGCGCGGCG CGAAGTGGAG ACGGCGGTTC GTGCGCTGCT TGCGCGGCTG CCGGGCCTCG ACGCGCTGAC GTTCGCGGAC GGCACGCCGT TCGCCGACGA CGCGACGCGC GCATGGCTCG GCGAGCTTGG CGCGCCTGTT GTGGCGGCGG ATGCGGTGTC GCCGTCGTCT TTGCCGCTTT CGCCGCGACC TTCGCCGCCT GAGCGATCGT CGCCGATGGC GGGCGAACCG GCGCGCGCGC CGGGCGATGC GTGCGGGGCG AGCGCCGACG ATGCAGTGGA CCGAGCGTGC GCGTTTGCCG CGAGCGGCCA GCTCGATCTC GCGCTCCACG CGATTCAGCA TGCGATCGAT CGTGCGACGA GCGCCGAACA GCGGTTGAGA GCGCGCGTGC GGTTGTGCGA GCTTGCGCGC GACCATTGGC CGCATGAGGT TCCTGAGGCG TTCGCGCGCG GCGTGATCGA ACCGATTCGG CGGCACGATT TGCTCGCATG GAATCCGGAG CTGGCGCTCG ACGGCTTGTC GGCCGCCTAT GCGCTGCTGA TTCGGCGCGA TCGCGAATCG GCGCACGCGA GGACGGTGCT TGACGAGATC GCGAGCGTCG ACGCGGCGCG GGCCATGCGT TTGTCGACGT GA
|
Protein sequence | MGMSERRPPG GAAARARMPI DVEALAVLGR TDIDSAMPAG ADVRADARFD ALHAELAKLA SPGASGQVDW RAATHLAAEL LRERGKDLLV GCYLAGALLQ TGGAAGLRCG LEIVGDLVER HWDAMSPPVS RMRARRGALQ WLVDRVDAMH DAGAAACGGA CSAELVAQLR AAARRIDALL AERDDDAPTM RAVHAFAERL PVEVVEVVEV ADEADVAEAT EATEATEATE ATEAADVAET AETAEADAHG STGGPAAEIA IAAAEQALID PAGRAAPSAG TDTNANADAA RQPARLDEAA GRERALADAL AQLHCVATAF AQADWADARG FRLRRVACWS SVCALPETDA ENGRTRIAAP SASIVGAAKN IDGDGEPVAA VRFAEAHAQA FPLWLDLQRI AARALARAGG DGADARREVE TAVRALLARL PGLDALTFAD GTPFADDATR AWLGELGAPV VAADAVSPSS LPLSPRPSPP ERSSPMAGEP ARAPGDACGA SADDAVDRAC AFAASGQLDL ALHAIQHAID RATSAEQRLR ARVRLCELAR DHWPHEVPEA FARGVIEPIR RHDLLAWNPE LALDGLSAAY ALLIRRDRES AHARTVLDEI ASVDAARAMR LST
|
| |