Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS1106A_2022 |
Symbol | |
ID | 4900153 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 1106a |
Kingdom | Bacteria |
Replicon accession | NC_009076 |
Strand | + |
Start bp | 1985549 |
End bp | 1988251 |
Gene Length | 2703 bp |
Protein Length | 900 aa |
Translation table | 11 |
GC content | 73% |
IMG OID | 640135252 |
Product | argininosuccinate lyase |
Protein accession | YP_001066287 |
Protein GI | 126454180 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase [COG0439] Biotin carboxylase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 0 |
Plasmid unclonability p-value | 0.10658 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | TTGGCGGCCA CATTCATCGT GAAGACCTTC GTATTCATCG AAAGCAACAC CACCGGCACC GGCCGGCTCT GTCTGCAAAA AGCGCTGCTG CGCGGCTTCG ACGTGCTGTT CGTCACGAGC CGGCCGCAGC TCTATCCGTT CCTGCAGGAA GAGATGGTCG TGCCGCTCGT CGCCGACACG GCCGATCCGC AGCGGATCGC CGATGCACTT GCGCCGTATG CGGGCATCGC CGGGATCTTC TCGACGTCCG AGTACTACAT CGAAACCGCC GCGACGGTGG CCACGCGCCT GGGCTTGCCC GCGGCGGATC CGGAGGCGAT CCGCACCTGC CGCGACAAGG GCCGGCTGCA CCGCCGCCTG CGCGACGCGG GCGTCGGCGT GGCCGACACC GAGATCGTGT CCGAGCGCAC GCAACTGCGC GACCTGGCGC ACGGCGCCAC GTATCCGCGC GTGCTGAAGC CGGCGTTCGG CTCCGGCAGC GTCGGCGTGC GGCTCGTGCG GACGCCGGCC GAAATGCTCG CGCACGGCGA GCGCATGCTC GACGCGCGCG GCAACGAGCG CGGCATCGCG CTCGCGCGGC AGGTGCTCGT GCAATCGTTC GTCGACGGGC CGGAATTTTC GGTCGAAGTC GTCGGGCTCG GCGCGGAGCA CGGCCATGCG GTGCTCGGCG TGACGGGCAA GCACCTCGGG CCGCTGCCGC ACTTCGTCGA AGCCGGCCAC GATTTTCCGG CGCCGATCGC GGCCGCGCAG CGCGATGCGA TCGTGGCCGA GACGCTGCGC GCGCTCGACG CGGTGGGCCA CCGCTTCGGG CCCGCCCATG TCGAATGCCG CGTGAGCGGC GGCAAGGTCG TCGTGATCGA GATCAATCCG CGTCTCGCGG GCGGCATGAT CCCGCAGGCG ATCGAATGGG CGACGGGCGT CGACGTGCTC GGCGCGATGA TCGACCTGCA CGCGGGCACG CCGCCTGACC TGGGCCCGCG CCGCCGCGGC CACGCGGCGA TCCGCTTCGT GCTGCCCGCG CGCAGCGGCG AGCTGAGGGC GCTGTCGTTC GAGCCCGACG AGCGCTTTGC GGGGGTGCGC ACGCGCTTCA TGCCGCTCAA GCAGCTTGGC CAGCGCATCG AGCCGGCCGG CGACTTCCGC GACCGTCTCG CGCTCGTCAT CGCGTCCGCG GCCGATCCGG ACGCGCTCGC GCACGCGCTC GAGGACGTCG ATCGCTGCGT GACGGTCGCG ATCGGCGACG CCGGCGCGGC GGGCGAGGGC GCAGGCGCCG GCCGGCTGCG CCGCACGCTG CATCCGGAGG CGCTCGCGAT CGTGCGCAAG CCGGCGCCGC GCGCCGAGCG GCTCGCCGAA CTCGACGCGT TCGCGGCGAT CGACGAGGCG CACCTGCTGA TGCTCGTCGA CGCGGGAATC TGCGACCGGA CGCGGGCCGC GACGGTGCTC GCGGAACTCG CGCGGCAGCG CGACGCGAAA TTCGCCGCGA TCGCCGACGC GATCGCGCCG CGCGGCACCT ACGCACTGTA CGAGCAACTG CTCATCGAGC GGGTCGGGAT CGACGCGGGG GGCGCGGTGC ATACGGCGCG CTCGCGCAAC GATATCAACG CGTGCGTCGC GAAGCTGCGC GCACGCGAGT GGTTCGACAC GTGCGGCGGC AAGCTGTGGC GCGTGCGCGC GGCGATCGTC GACAAGGCGC AGCACACGCT CGACTGGCCG TTGCCCACGT ACAGCCAGTA CCAGGCGGCG CAGCCCGGCA GCTTCGGCTA TTACCTGTGG TCGGTCGAGA CCGCGCTGCG GCGCGACCAG GCCGCGCTCG AACGGCTCGA CGAGGAGCTC GCCGTCTGTC CGCTCGGCGC GGGCGCGGGC GCGGGCACCG ATTTCCCGAT CCGCCCGGGC GTGAGCGCGG CGCTGCTCGG CTTCGCGCGC AGCTTCGACA GCGCGCTCGA CGCGGTCGCG AGCCGCGATC TCGTGCTGCA TTTCCTGGCC GCGATCGCGA TCGCATCGAC GACGCTCAGC CGGCTCGCGC ACGACCTGCA GCTCTGGACG ATGCGCGAGA CCGACTTCCT CGCGCTGCCG GACGAACTGA GCGGCGGCTC GTCGCTGATG CCGCAGAAGA AGAACCCATA CCTGCTGGAG ATCGTCAAAG GCAAGCTCGC GCACGTCGCG GGCGCGCTGA ACGCGGCGGT GTTCGCGTCG CAGCGCACGC CGTTCAGCAA TTCGGTCGAG ATCGGCACCG AGATGCTCGC GCCGTGCGCG GACGCCGTGC AGGCGTTCGG CGAAAGCTGC GATCTGCTGC GGCTGATGGT GAGCGGCGTG ACGGGCGATC CGGCGAAGAT GCGCGCGGCG GCCGAGGCGG GGCTCGTGAG CGCGACGCAG GTCGCCAACG CGCTGGTGCG GGAGACGGAC ATCAGCTTTC ACGCCGCGCA TCGGCAGATC GGCGCGCTGA TCACGCAGGC GCTCGACGCG CACGAGGACC CGGCCGCGGC GCTCGACGCG CTCGTGCGGC AGCCGGGCGC ATCGATCGAC GAAGCGGCCG CGCGGCTCGC CTACGGCGGC GGGCCCGGCG CGGCGGGCGC GGGGCTCGCG CGCTCGCGCG CGCTGCTGCG GCAGTCGGCC GAACGCCTGT GGCGGCGCCG CGCCGCGTGG CACGCGGCGC ACGCGCGGCG GCGCGGGTGC GTCGCCGATC TGCTCGCGGC GGCGGCGGCC TGA
|
Protein sequence | MAATFIVKTF VFIESNTTGT GRLCLQKALL RGFDVLFVTS RPQLYPFLQE EMVVPLVADT ADPQRIADAL APYAGIAGIF STSEYYIETA ATVATRLGLP AADPEAIRTC RDKGRLHRRL RDAGVGVADT EIVSERTQLR DLAHGATYPR VLKPAFGSGS VGVRLVRTPA EMLAHGERML DARGNERGIA LARQVLVQSF VDGPEFSVEV VGLGAEHGHA VLGVTGKHLG PLPHFVEAGH DFPAPIAAAQ RDAIVAETLR ALDAVGHRFG PAHVECRVSG GKVVVIEINP RLAGGMIPQA IEWATGVDVL GAMIDLHAGT PPDLGPRRRG HAAIRFVLPA RSGELRALSF EPDERFAGVR TRFMPLKQLG QRIEPAGDFR DRLALVIASA ADPDALAHAL EDVDRCVTVA IGDAGAAGEG AGAGRLRRTL HPEALAIVRK PAPRAERLAE LDAFAAIDEA HLLMLVDAGI CDRTRAATVL AELARQRDAK FAAIADAIAP RGTYALYEQL LIERVGIDAG GAVHTARSRN DINACVAKLR AREWFDTCGG KLWRVRAAIV DKAQHTLDWP LPTYSQYQAA QPGSFGYYLW SVETALRRDQ AALERLDEEL AVCPLGAGAG AGTDFPIRPG VSAALLGFAR SFDSALDAVA SRDLVLHFLA AIAIASTTLS RLAHDLQLWT MRETDFLALP DELSGGSSLM PQKKNPYLLE IVKGKLAHVA GALNAAVFAS QRTPFSNSVE IGTEMLAPCA DAVQAFGESC DLLRLMVSGV TGDPAKMRAA AEAGLVSATQ VANALVRETD ISFHAAHRQI GALITQALDA HEDPAAALDA LVRQPGASID EAAARLAYGG GPGAAGAGLA RSRALLRQSA ERLWRRRAAW HAAHARRRGC VADLLAAAAA
|
| |