Gene Gdia_0084 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGdia_0084 
Symbol 
ID6973473 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGluconacetobacter diazotrophicus PAl 5 
KingdomBacteria 
Replicon accessionNC_011365 
Strand
Start bp96337 
End bp97566 
Gene Length1230 bp 
Protein Length409 aa 
Translation table11 
GC content65% 
IMG OID643389615 
Productargininosuccinate synthase 
Protein accessionYP_002274499 
Protein GI209542270 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0137] Argininosuccinate synthase 
TIGRFAM ID[TIGR00032] argininosuccinate synthase 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.0882429 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones33 
Fosmid unclonability p-value0.0155281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCCGTAA AGGACGTCAA GAAGGTCGTG CTCGCCTATT CCGGCGGGCT CGATACATCG 
GTAATTCTGC GCTGGCTGCA GACCACCTAC GGGTGCGAGG TCGTCACCTT CACCGCCGAC
CTCGGCCAGG GCGAGGAACT GGAACCCGCC CGCAAGAAGG CCGAAATGTT CGGCGTGAAG
GAAATCTTCG TCGAGGACCT GCGCGAGACC TTCGTCAAGG ACTTCGTCTT CCCGATGTTC
CGCGCCAACA CGCTGTATGA AGGCCAGTAC CTGCTGGGGA CCTCCATCGC GCGTCCGCTG
ATCGCCCAGC GCCAGATCGA AATCGCCGAG GCCGTGGGTG CCGACGCCGT GGCCCATGGC
GCGACGGGCA AGGGCAACGA CCAGGTGCGC TTCGAACTGG CCTATTACGC GCTGAAGCCC
GACGTGACGG TCATCTCCCC CTGGCGGGAA TGGGACCTGA CCTCGCGCAC GCGGCTGCTG
GCCTTCGCCG AGGAACATCA GATCCCTATC GCGAAGGACA AGCGCGGCGA GGCCCCGTTC
TCGGTCGATG CCAACCTGCT GCACTCCTCG TCCGAAGGCA AGCTGCTGGA AGACCCTGCC
GTCGCACCCG ATGAAATCGT CTTCCAGCGC ACGATCTCGC CCGAGGCCGC GCCGGACGTC
GCGACCGAAA TCGCGATCGA TTTCGTCTCG GGCGACCCGG TGGCGCTGAA CGGCGTCACC
CTGTCCCCCG CCACGCTGCT GGCGCGGCTG AACGAACTGG GCAAGGCCAA CGGGATCGGG
CGGCTGGACC TGGTGGAAAA CCGCTTCGTC GGCATGAAGT CGCGCGGCAT CTACGAAACG
CCGGGCGGCA CCATCCTGCT GGCGGCGCAT CGCAGCATGG AAACCATCAC GCTGGACCGC
GAGGCCGGGC ACCTGAAGGA CAGCCTGATG CCCCGCTATG CCGAACTGAT CTATAACGGC
TTCTGGTTCT CGCCCGAGCG GCGCATGCTC CAGGCCCTGA TCGACGAAAG CCAGCATTCC
GTGACCGGAC GCGTGCGGCT GAAGCTGTAC AAGGGCAATG TGATCTGCGT CGGGCGGGAA
AGCCCCCATA GCCTGTACGA TACCCGCGTT GTGACATTCG AAGACGACGA AGGGGCGTAT
AATCAAAGCG ATGCACTGGG CTTCATCAAG CTGAACGCCC TGCGTCTGCG TCTGGGCGCG
CAGATCGGAC GGCGCGGCGG CGCGCTGTAG
 
Protein sequence
MAVKDVKKVV LAYSGGLDTS VILRWLQTTY GCEVVTFTAD LGQGEELEPA RKKAEMFGVK 
EIFVEDLRET FVKDFVFPMF RANTLYEGQY LLGTSIARPL IAQRQIEIAE AVGADAVAHG
ATGKGNDQVR FELAYYALKP DVTVISPWRE WDLTSRTRLL AFAEEHQIPI AKDKRGEAPF
SVDANLLHSS SEGKLLEDPA VAPDEIVFQR TISPEAAPDV ATEIAIDFVS GDPVALNGVT
LSPATLLARL NELGKANGIG RLDLVENRFV GMKSRGIYET PGGTILLAAH RSMETITLDR
EAGHLKDSLM PRYAELIYNG FWFSPERRML QALIDESQHS VTGRVRLKLY KGNVICVGRE
SPHSLYDTRV VTFEDDEGAY NQSDALGFIK LNALRLRLGA QIGRRGGAL