Gene BURPS668_1061 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagBURPS668_1061 
SymbolargH 
ID4881738 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameBurkholderia pseudomallei 668 
KingdomBacteria 
Replicon accessionNC_009074 
Strand
Start bp1036746 
End bp1038155 
Gene Length1410 bp 
Protein Length469 aa 
Translation table11 
GC content68% 
IMG OID640126989 
Productargininosuccinate lyase 
Protein accessionYP_001058111 
Protein GI126439172 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACGTCCC AACTGCACAA AAAGGGCGAG GCCTGGTCGG CCCGCTTTTC GGAACCGATG 
TCCGAGCTCG TCAAGCGCTA CACGTCGTCG GTCTTCTTCG ACAAGCGGCT CGCGCTCGTC
GACATCGCCG GCTCGCTCGC GCACGCGGGC ATGCTCGCCG CGCAGAAGAT CATCAGCGCC
GACGACCTCG CCGCGATCGA GCGCGGGATG GCGCAAATCA AGGGCGAGAT CGAGCGCGGC
GAATTCGAAT GGCAGCTCGA TCTCGAAGAC GTCCACCTGA ACATCGAGGC GCGCCTGACC
GCGCTCATCG GCGATGCGGG CAAGCGCCTG CACACGGGCC GCTCGCGCAA CGATCAGGTC
GCGACCGACA TCCGCCTGTG GCTGCGCGGC GAGATCGACC GGATCGGCGG CCTGCTGAAC
GACCTGCGCG GCGCGCTGAT CGATCTCGCC GAACAGAACG CGGACACGAT CCTGCCGGGC
TTCACGCACC TGCAGGTCGC GCAGCCTGTC ACGTTCGGCC ATCACCTGCT TGCCTACGTC
GAGATGTTCT CGCGCGACGC CGGGCGCATG CGCGACTGCC GCGCGCGCGT GAACCGCCTG
CCGCTCGGCG CGGCGGCGCT CGCGGGCACC AGCTATCCGA TCGACCGCCA CGCGGTGGCG
AAGACGCTCG GCTTCGACGG CATCTGCGCG AACTCGCTCG ACGCGGTGTC CGATCGCGAC
TTCGCGATCG AATTCACGGC CGCGGCCGCG CTCGTGATGA CGCACGTGTC GCGCTTCTCG
GAAGAACTCG TGCTGTGGAT GAGCCCGCGC GTGGGCTTCA TCGACATCGC CGACCGCTTC
TGCACCGGCA GCTCGATCAT GCCGCAGAAG AAGAACCCGG ACGTGCCCGA GCTCGCGCGC
GGCAAGACGG GCCGCGTGAA CGGCCACCTG ATGGCGCTGC TCACGCTGAT GAAGGGCCAG
CCGCTCGCGT ACAACAAGGA CAATCAGGAA GACAAGGAAC CGCTGTTCGA CACGGTCGAC
ACCGTCGCCG ACACGCTGCG GATCTTCGCG GAGATGGTCG CGGGCATCAC GGTGAAGCCG
GACGCGATGC GCGCGGCCGC GCTGCAGGGC TTCTCGACCG CGACGGATCT CGCGGACTAC
CTGGTCAAGC GCGGGCTGCC GTTCCGCGAC GCGCACGAGG CGGTCGCGCA CGCGGTGAAG
GTCTGCGACG CGCGCGGCAT CGACCTCGCG GATCTGACGC TCGACGAAAT GAAGCAGGAA
CTGCCGAACG TCGCGCATCT GATCGGCGAA GACGTGTTCG ACTATCTGAC GCTCGAAGGC
TCGGTCGCGA GCCGCAATCA TCCGGGCGGC ACCGCGCCGG ACCAGGTGCG CGCGGCGGCG
AAGGCCGCGC GCGCGGCGCT CGGCCAGTAA
 
Protein sequence
MTSQLHKKGE AWSARFSEPM SELVKRYTSS VFFDKRLALV DIAGSLAHAG MLAAQKIISA 
DDLAAIERGM AQIKGEIERG EFEWQLDLED VHLNIEARLT ALIGDAGKRL HTGRSRNDQV
ATDIRLWLRG EIDRIGGLLN DLRGALIDLA EQNADTILPG FTHLQVAQPV TFGHHLLAYV
EMFSRDAGRM RDCRARVNRL PLGAAALAGT SYPIDRHAVA KTLGFDGICA NSLDAVSDRD
FAIEFTAAAA LVMTHVSRFS EELVLWMSPR VGFIDIADRF CTGSSIMPQK KNPDVPELAR
GKTGRVNGHL MALLTLMKGQ PLAYNKDNQE DKEPLFDTVD TVADTLRIFA EMVAGITVKP
DAMRAAALQG FSTATDLADY LVKRGLPFRD AHEAVAHAVK VCDARGIDLA DLTLDEMKQE
LPNVAHLIGE DVFDYLTLEG SVASRNHPGG TAPDQVRAAA KAARAALGQ