Gene GWCH70_2702 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGWCH70_2702 
Symbol 
ID7976522 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacillus sp. WCH70 
KingdomBacteria 
Replicon accessionNC_012793 
Strand
Start bp2734838 
End bp2736217 
Gene Length1380 bp 
Protein Length459 aa 
Translation table11 
GC content45% 
IMG OID644799501 
Productargininosuccinate lyase 
Protein accessionYP_002950660 
Protein GI239828036 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0165] Argininosuccinate lyase 
TIGRFAM ID[TIGR00838] argininosuccinate lyase 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGC TTTGGGGAGG ACGTTTTAAG AAAACAGCGG AAGAGTGGGT CGACGAGTTT 
GGAGCGTCGA TCCCATTCGA CCAAGAGTTA GTCGAAGAAG ACATAGAGGG CAGCATCGCT
CATGCAACGA TGCTTGGCAA ATGCGGAATT TTGCCAAGTG AGGATGTGGA GAAAATCAAG
GCCGGACTGT TTACGCTGCT AGAAAAAGCG AAACAAGGCA AGCTCGAATT TTCTGTTGCC
TACGAGGATA TTCATTTAAA TATTGAAAAA ATGCTTATTG ATGAAATTGG CCCTGTCGGT
GGGAAACTGC ACACAGGAAG AAGCCGCAAC GACCAAGTGG CAACCGATAT GCATTTATAT
TTGCGCAAAC GCGTAACGGA AATCATCGCG CTTATTCAAG AATTGCAAAA GGTGCTTGTC
GAAAAAGCGG AGGAGCACGT GGAAACAATT GTACCGGGAT ATACACATTT GCAGCGGGCG
CAGCCGATTT CGTTTGCCCA TCATTTGCTT GCGTATTTTT GGATGCTTGA ACGCGACCGC
GAGCGATTCC GTGAATCATT AAAGCGCATT AATAAATCAC CGCTCGGGGC GGGAGCGCTC
GCTGGAACAA CGTTTCCAAT TGACCGTCAT TTGACCGCGG AACTTCTTGG CTTTGACGGC
ATCTACGAAA ACAGTATCGA TGCGGTGAGC GACCGTGATT TTATCATTGA ATTTTTAAGC
AACAGCTCCA TGCTGATGAT GCATTTATCA CGTTTTTGCG AGGAGCTTAT TCTTTGGTCA
AGCCAAGAGT TTCAATTTAT TGAAATTGAC GATGCGTTCG CCACAGGAAG CAGCATTATG
CCGCAAAAGA AAAATCCGGA CATGGCAGAA TTAATTCGCG GAAAAACGGG ACGGGTATAC
GGAAATTTAT TAGCGCTTTT GACAGTGATG AAAGGAACGC CGCTTGCGTA CAACAAAGAT
ATGCAAGAAG ACAAAGAAGG CATGTTCGAT ACGGTGAAAA CGGTCACTGG ATCGCTGAAA
ATTTTCGCTG GTATGATTAA AACGATGAAA GTAAACGTCG ATGTTATGGA AAAAGCGACA
AAACAAGATT TTTCGAATGC AACGGAGCTT GCCGACTACT TAGCCAACAA AGGGGTACCA
TTCCGCGAGG CGCATGAGAT TGTTGGCAAA CTCGTGCTAA TTTGTATTGA AAAAGGCGTA
TTTTTAGCCG ATTTGCCGCT GGATGTCTAT AAGGAAGCGT CACCGTTGTT TGAAGAAGAT
ATATATGAAG CGCTTAAGCC ATACACCGCT GTTAACCGCC GCAATAGCGC GGGCGGAACA
GGTTTTTCCG AAGTAAGAAA AGCGTTGGAA AAAGCTAAAA AAATAGTGAA CACTCCGTAG
 
Protein sequence
MKKLWGGRFK KTAEEWVDEF GASIPFDQEL VEEDIEGSIA HATMLGKCGI LPSEDVEKIK 
AGLFTLLEKA KQGKLEFSVA YEDIHLNIEK MLIDEIGPVG GKLHTGRSRN DQVATDMHLY
LRKRVTEIIA LIQELQKVLV EKAEEHVETI VPGYTHLQRA QPISFAHHLL AYFWMLERDR
ERFRESLKRI NKSPLGAGAL AGTTFPIDRH LTAELLGFDG IYENSIDAVS DRDFIIEFLS
NSSMLMMHLS RFCEELILWS SQEFQFIEID DAFATGSSIM PQKKNPDMAE LIRGKTGRVY
GNLLALLTVM KGTPLAYNKD MQEDKEGMFD TVKTVTGSLK IFAGMIKTMK VNVDVMEKAT
KQDFSNATEL ADYLANKGVP FREAHEIVGK LVLICIEKGV FLADLPLDVY KEASPLFEED
IYEALKPYTA VNRRNSAGGT GFSEVRKALE KAKKIVNTP