Gene BURPS1106A_1067 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS1106A_1067 
SymbolargH 
ID4900675 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 1106a 
KingdomBacteria 
Replicon accessionNC_009076 
Strand
Start bp1045757 
End bp1047166 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content68% 
IMG OID640134297 
Productargininosuccinate lyase 
Protein accessionYP_001065347 
Protein GI126452237 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.447898 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCCC AACTGCACAA AAAGGGCGAG GCCTGGTCGG CCCGCTTTTC GGAACCGATG 
TCCGAGCTCG TCAAGCGCTA CACGTCGTCG GTCTTCTTCG ACAAGCGGCT CGCGCTCGTC
GACATCGCCG GCTCGCTCGC GCACGCGGGC ATGCTCGCCG CGCAGAAGAT CATCAGCGCC
GACGACCTCG CCGCGATCGA GCGCGGGATG GCGCAAATCA AGGGCGAGAT CGAGCGCGGC
GAATTCGAAT GGCAGCTCGA TCTCGAAGAC GTCCACCTGA ACATCGAGGC GCGCCTGACC
GCGCTCATCG GCGATGCGGG CAAGCGCCTG CACACGGGCC GCTCGCGCAA CGATCAGGTC
GCGACCGACA TCCGCCTGTG GCTGCGCGGC GAGATCGACC GGATCGGCGG CCTGCTGAAC
GACCTGCGCG GCGCGCTGAT CGATCTCGCC GAACAGAACG CGGACACGAT CCTGCCGGGC
TTCACGCACC TGCAGGTCGC GCAGCCTGTC ACGTTCGGCC ATCACCTGCT TGCCTACGTC
GAGATGTTCT CGCGCGACGC CGAGCGCATG CGCGACTGCC GCGCGCGCGT GAACCGCCTG
CCGCTCGGCG CGGCGGCGCT CGCGGGCACC AGCTATCCGA TCGACCGCCA CGCGGTGGCG
AAGACGCTCG GCTTCGACGG CATCTGCGCG AACTCGCTCG ACGCGGTGTC CGATCGCGAC
TTCGCGATCG AATTCACGGC CGCGGCCGCG CTCGTGATGA CGCACGTGTC GCGCTTCTCG
GAAGAACTCG TGCTGTGGAT GAGCCCGCGC GTGGGCTTCA TCGACATCGC CGACCGCTTC
TGCACCGGCA GCTCGATCAT GCCGCAGAAG AAGAACCCGG ACGTGCCCGA GCTCGCGCGC
GGCAAGACGG GCCGCGTGAA CGGCCACCTG ATGGCGCTGC TCACGCTGAT GAAGGGCCAG
CCGCTCGCGT ACAACAAGGA CAATCAGGAA GACAAGGAAC CGCTGTTCGA CACGGTCGAC
ACCGTCGCCG ACACGCTGCG GATCTTCGCG GAGATGGTCG CGGGCATCAC GGTGAAGCCG
GACGCGATGC GCGCGGCCGC ACTGCAGGGC TTCTCGACCG CGACGGATCT CGCGGACTAC
CTGGTCAAGC GCGGGCTGCC GTTCCGCGAC GCGCACGAGG CGGTCGCGCA CGCGGTGAAG
GTCTGCGACG CGCGCGGCAT CGACCTCGCG GATCTGACGC TCGACGAAAT GAAGCAGGAA
CTGCCGAACG TCGCGCATCT GATCGGCGAG GACGTGTTCG ACTATCTGAC GCTCGAAGGC
TCGGTCGCGA GCCGCAATCA TCCGGGCGGC ACCGCGCCGG ACCAGGTGCG CGCGGCGGCG
AAGGCCGCGC GCGCGGCGCT CGGCCAGTAG
 
Protein sequence
MTSQLHKKGE AWSARFSEPM SELVKRYTSS VFFDKRLALV DIAGSLAHAG MLAAQKIISA 
DDLAAIERGM AQIKGEIERG EFEWQLDLED VHLNIEARLT ALIGDAGKRL HTGRSRNDQV
ATDIRLWLRG EIDRIGGLLN DLRGALIDLA EQNADTILPG FTHLQVAQPV TFGHHLLAYV
EMFSRDAERM RDCRARVNRL PLGAAALAGT SYPIDRHAVA KTLGFDGICA NSLDAVSDRD
FAIEFTAAAA LVMTHVSRFS EELVLWMSPR VGFIDIADRF CTGSSIMPQK KNPDVPELAR
GKTGRVNGHL MALLTLMKGQ PLAYNKDNQE DKEPLFDTVD TVADTLRIFA EMVAGITVKP
DAMRAAALQG FSTATDLADY LVKRGLPFRD AHEAVAHAVK VCDARGIDLA DLTLDEMKQE
LPNVAHLIGE DVFDYLTLEG SVASRNHPGG TAPDQVRAAA KAARAALGQ