Gene BURPS1106A_2017 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_2017 
SymbolargH 
ID4899483 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1980152 
End bp1981552 
Gene Length1401 bp 
Protein Length466 aa 
Translation table11 
GC content68% 
IMG OID640135247 
Productargininosuccinate lyase 
Protein accessionYP_001066282 
Protein GI126452580 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.178191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGCAATC CGATGGCGAA TCCGTGGGCC GGGCGCTTCA GCCAGACGAT GACCGACAGC 
CTGGTGGCGT TCAACACGAG CCTGCCGCTC GAGACGCGGC TCTTCGAGGC CGACATCGAC
GGCACGGCCG CGCACGTCGA GATGCTGCAT GCGACGGGCC TGCTGGAGAC GGCGGAGCAC
GAGGCGCTCG CGCGCGCGCT CGACGAGATC CGCGCCGCGT GGCGCGCGGG CGAGATCCGG
CTGTCGCCCG CGCTCGAGGA CATCCACATG AACCTCGAGA CGCTGCTGGT CGACAAGCTC
GGCGAGCTCG GCAAGAAAAC GCATACCGCG CGCAGCCGCA ACGATCAGCA GGCGAGCGCG
CAGCGCCTCT ATTTCATGCG CTCGACGCGC GAGCTGGTGG ACGCGATCGA CGCGCTGCAG
CGCGCGATCC TCGAGCACGG CGAGCGCCAC GACGCGCTCG TGATGCCATC GTACACGCAC
CTGCAGCGCG CCGAATTCAC GTATTACGCG CACTGGCTCG CGACTTACGT GGTGATGCTC
GAGCGCGACC GCAGCCGCTT CGTCGACGCG CTCGCGCGCG CCGACCAGTG CCCGCTCGGC
GCCTGTGCGT CGACCGGCAC GAGCCTGCCG ATCGACCGCC GGCGCTCGGC GTCGCGGCTG
GGCTTCAGGG AGCCCACGCT GCACAGCATC GATTCGGTGT CCGACCGCGA CTACCTCGTC
GAATTCTGCT CGCACGCGGC GAATCTGATG ATCCATCTGT CGCGGCTGTC GGAGGAGCTC
ATTTCGTTCA CGAGCCAGGA GTTCGGCTTC ATCGCGCTCG CGGACGGCTA CTGCACCGGC
AGCTCGATCA TGCCGCAGAA GAAGAACCCC GACGTGCCGG AGCTCGTGCG CGGCAAGGCG
GCATCGGTGA TCGGCAACGC GATGAGCCTG ATGGCGCTCC TGAAGGCGCT GCCGCTCGGC
TACAACAAGG ATCTGCAGGA GGACAAGACC GCGTGGTTCG CCGCGCTCGA CAACTGCATG
TCGTCGCTCG CGATCCTGAC CGAGCTGGTT CGCACGATGG CGCCCGTGCC CGACAGGATG
CGCCAGGCGA CGCTCGGCGG GCACATCATC GCCACCGAGT ACGCGAACTA CCTGGTCCGC
AAGGGCATGC CGTTTCGCGA AGCGCATCGG GTCGTCGGCG AGCTGGTGAA GACGGCGGAC
GCACGCGGCG TCGACGTGTC GGCGCTGCCC CAGGCGGCAT TCGCCGAGGC GAGCCCGCTG
TTCGGCGACG ACATCGGCCG CGTCACCGTC GAGGACATGG TGTCGCGCAA GAACTCGTAC
GGCTCGTGCG GCGACCAGGC GCTCGGCGAG CTGCTCGCGC AGCTTCGCTC GATGCTGGAC
GCGCATCGTC TCGGCCGATG A
 
Protein sequence
MSNPMANPWA GRFSQTMTDS LVAFNTSLPL ETRLFEADID GTAAHVEMLH ATGLLETAEH 
EALARALDEI RAAWRAGEIR LSPALEDIHM NLETLLVDKL GELGKKTHTA RSRNDQQASA
QRLYFMRSTR ELVDAIDALQ RAILEHGERH DALVMPSYTH LQRAEFTYYA HWLATYVVML
ERDRSRFVDA LARADQCPLG ACASTGTSLP IDRRRSASRL GFREPTLHSI DSVSDRDYLV
EFCSHAANLM IHLSRLSEEL ISFTSQEFGF IALADGYCTG SSIMPQKKNP DVPELVRGKA
ASVIGNAMSL MALLKALPLG YNKDLQEDKT AWFAALDNCM SSLAILTELV RTMAPVPDRM
RQATLGGHII ATEYANYLVR KGMPFREAHR VVGELVKTAD ARGVDVSALP QAAFAEASPL
FGDDIGRVTV EDMVSRKNSY GSCGDQALGE LLAQLRSMLD AHRLGR