Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | BURPS668_2507 |
Symbol | |
ID | 4884988 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Burkholderia pseudomallei 668 |
Kingdom | Bacteria |
Replicon accession | NC_009074 |
Strand | - |
Start bp | 2467656 |
End bp | 2470394 |
Gene Length | 2739 bp |
Protein Length | 912 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 640128435 |
Product | argininosuccinate lyase |
Protein accession | YP_001059534 |
Protein GI | 126438665 |
COG category | [E] Amino acid transport and metabolism [I] Lipid transport and metabolism |
COG ID | [COG0165] Argininosuccinate lyase [COG0439] Biotin carboxylase |
TIGRFAM ID | [TIGR00838] argininosuccinate lyase [TIGR02019] bacteriochlorophyll 4-vinyl reductase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGGGCA TCTTTGTCTT CATCGAGAGC AACACGACGG GCACGGGCGA GCTCCTCGTG CGCAAGGCGC TGCAACGCGG CCTCACGCCT TACTTTCTCA CCGCGAACCG CGGCAAGTAT CCGTTTCTCG ACGCGATCCG GGTCGTGACG ATCTCGCTCG ACACGAGCGA CGCCGACCGG ATCCACCGCT TCGTTTCGTC GCTCGACGGC GTGGCGGGCG TGATGTCGTC GTCCGAGTAT TTCATCGAAG TCGCGAGCGA AGTCGCGCGG CGGCTCGGGT TGCCGACCGC GAACACCGAA GCGACGCGTG TGTGCCGCGA CAAGAAGCGT CTCGCGCGGA CGCTCGCCGA GCACGGGATA GACGTGCCGC GCACGCACGC GCTTGCGCTC GACGCCGACG CCGACGCCGA TGCCGATGCC GATGCCGATG CCGACGCCGT CGCGCTGTCC GCGCTCGACG GGCTCGCCTA TCCGGTTGTC GTCAAGCCGA GGATGGGCTC CGGCAGCGTC GGCGTGCGAC TGTGCGCGAG CGTCGACGAG GCGGCCGAGC ACTGCGCGGC GCTGCGGCGC GCGGGCACGC GCGCGGCGCT CGTGCAGGCC TATGTCGAAG GCGACGAGTA TTCGGTCGAG ACACTGACGG TCGCGCGCAG CACGCAGATC GTCGGCATCG TCAGAAAACG CCTCGGGCGC GAGCCGCACT TCGTCGAGAT CGGTCATGAC TATCCGGCGC CGTTGTCGAG CCCGCAGCGC GAGCGCATCG AGCGCACGGT GCTGCGCGCG CTCGAGGCGC TCGGCTACGC GTTCGGGCCC GCGCACACCG AGCTGCGCGT GCGCGGCGAC ACGGTCACGA TCATCGAGAT CAATCCGCGC CTCGCGGGCG GTCTGATTCC GGTGCTGCTC GGCGAGGTAT TCGACGTCGA CCTGCTCGAC CACGTGCTCG ACATGTGGCT CGGCGTGGCG GCGTTTGCCG ATCTCACCGC GAAACGCTAC GGCGCGATCC GCTTCGCGCT GCCGGCGCGC GAGGGCGTGC TGCGCGGCCC GCTCGCGCTG CCGGCCGACA TCGCCGCGCG GCCCGAGCTC CGGCATTTCC ATCCGATCGC GCAGCCGGGC GACGCGCTGC GGCTCGAAGG CAGCTTCCGC GACCGCATCG CCGCCGTCGT CTGCGCGGGC GATCATCGCG AATCGGTCGA GGCGCTCGCC GAGCGTGCGG TGGCCGGGCT GAGCATCGAC ATCGGCGACG ACGCGCGCGC AGCCGCGTTA AACGAGTCGA ACGAGTCGAA CGGCGCGAAC GGCGCGAACG GCGCGAACGC GGCGACGCCC GGCCTGCCGC CTCGCCTGCA GGCGATCGTC TACGGCGACG GCGCGAGCGA GGCCCCACTC GCGGAACTCG ACCATCTGTT CGATCTGAAC GAGGCGCATC TCGTCATGCT CGGGGCGACG CGCATCGTCG CGCCCGAACG GGTCCGGCCG CTGCTCGACG CCCATCGCCG GCTGCGGCGC GCGGGCTACG CGCCGCTGCT CGCGCGGCCG AGGCCGCGGG GCCTGTACAT GCTCGTCGAG GCATACCTGA TCGAGACGCT CGGCGAGGAT GTCGGCGGCG TGCTGCAGAC GGGCCGCTCG CGCAACGACA TCAACGCGGC GACGACGAAG CTGCATTTGC GCGACGCGAC GTCGCGCGCG TTCGACGCGC TATGGCGCTT GCGGCGCAGC CTCGTCTTCA AGGCGTCGGC GAACGTCGAC TGCGCATTTC CGATCTACAG CCAGTACCAG CCGGCGCTGC CGGGCACGCT CGCGCATCAG CTGCTCGCGT TCGACGGCGC GCTCGCGCAC GAAACCCATG CGCTGTTCGC GTTGTTCCAG CACATCGATG TCTGCCCGCT CGGCGCCGGC GCGGGCGGTG GCACGACGCT GCCGATCGAT CCGGAGTTCG TCTGCCGGCT GCTCGGCTTC GAGCAGCCGG CGCCGAACAG CCTCGATGCG GTGGCGAACC GCAGCGGCGT CGTCCATTTC CTGTCGGCGA TGAACGCGAT CGGCCTCGTG CTGTCGCGTC TCGCGCAGGA CCTGCAGATC TGGACGACGG CGGAGTTCGC GCTCGTGTCG CTGCCCGCCG CGCTGACGGG CGGCTCGTCG ATGCTGCCGC AAAAGAAAAA CCCGTTTCTC GTCGAATTCG TGAAGAGCCG CGCGGGCGTG CCGTTCGGCG CGCTCGCGAG TTGCTCGGCG GCGCTCGGCA AGACGCCGTA CACCAATTCG TTCGAGGCGG GCTCGCCGAT GAACGGGCTG ATCGCGCAGG CGTGCGCGGC GATCGAGGAC GCGGCGGCGG TCGCCGTGCT GCTGATCGAC GGGCTCGAAG CGGCGCAGGC ACGCATCGAC GCCCATCTGA GGGACACGGG CGTGGTCGCG ATGGCGGTGG CCGAATCGCT CGCCGTTCGC CGGTCGATCG ATTTTCGCTC CGCGCACACG CGGGTTGCGC AGGCGGTGCG GGACAGCGCC GCGCAGGGGC GCTCGAGCCA CGATGCGCTC GCCGCGCTCG ACCCCGATTT CGTCTCGCGC GCGCCGCTGG AGTGGGCGCG CAGCCACCGT TTCGGCGGCG GCCCGGGCGC GGCCGACCTG AACCATGGCG TCGCGCGCGC GTGCCGTGCG CTCGCCGACG ACGAGGCCGT GTTTCGCCGC AAGCAGGACG TGTGGCGGGA GGCGGAACAG ATGCGGCGGC TCGCGGCGCA GCAACTGGCG GGCGATTGA
|
Protein sequence | MTGIFVFIES NTTGTGELLV RKALQRGLTP YFLTANRGKY PFLDAIRVVT ISLDTSDADR IHRFVSSLDG VAGVMSSSEY FIEVASEVAR RLGLPTANTE ATRVCRDKKR LARTLAEHGI DVPRTHALAL DADADADADA DADADAVALS ALDGLAYPVV VKPRMGSGSV GVRLCASVDE AAEHCAALRR AGTRAALVQA YVEGDEYSVE TLTVARSTQI VGIVRKRLGR EPHFVEIGHD YPAPLSSPQR ERIERTVLRA LEALGYAFGP AHTELRVRGD TVTIIEINPR LAGGLIPVLL GEVFDVDLLD HVLDMWLGVA AFADLTAKRY GAIRFALPAR EGVLRGPLAL PADIAARPEL RHFHPIAQPG DALRLEGSFR DRIAAVVCAG DHRESVEALA ERAVAGLSID IGDDARAAAL NESNESNGAN GANGANAATP GLPPRLQAIV YGDGASEAPL AELDHLFDLN EAHLVMLGAT RIVAPERVRP LLDAHRRLRR AGYAPLLARP RPRGLYMLVE AYLIETLGED VGGVLQTGRS RNDINAATTK LHLRDATSRA FDALWRLRRS LVFKASANVD CAFPIYSQYQ PALPGTLAHQ LLAFDGALAH ETHALFALFQ HIDVCPLGAG AGGGTTLPID PEFVCRLLGF EQPAPNSLDA VANRSGVVHF LSAMNAIGLV LSRLAQDLQI WTTAEFALVS LPAALTGGSS MLPQKKNPFL VEFVKSRAGV PFGALASCSA ALGKTPYTNS FEAGSPMNGL IAQACAAIED AAAVAVLLID GLEAAQARID AHLRDTGVVA MAVAESLAVR RSIDFRSAHT RVAQAVRDSA AQGRSSHDAL AALDPDFVSR APLEWARSHR FGGGPGAADL NHGVARACRA LADDEAVFRR KQDVWREAEQ MRRLAAQQLA GD
|
| |