Gene Avin_30570 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30570 
SymbollapG 
ID7761957 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3165878 
End bp3166930 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content68% 
IMG OID643805933 
Product4-hydroxy-2-ketovalerate aldolase 
Protein accessionYP_002800197 
Protein GI226945124 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR03217] 4-hydroxy-2-oxovalerate aldolase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00100274 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGCC AGTCCGACAG CCTGAAAGGC CGCAAGGTCG TTTTCCATGA CATGTGCCTG 
CGCGACGGCA TGCACGCCAA GCGCGAGCAG ATCGGCGTCG AGCAGATGAT CGCGGTCGCC
ACCGCCCTCG ACGCCGCCGG CGTGCCCTAC ATCCAGGTCA CCCATGGCGC CGGCCTCGGC
GGCAACTCGC TGCAGCACGG CTTTGCCCCG CACAGCAACG AGGAATACAT CGGCGCGGTG
GCCGCGAAGA TGAAACAGGC CAAGGTTTCG GTGCTGCTGA TTCCGGGGCT CGGCACCATG
AAGGAGCTGC AGTCGGCCTT CGACTGCGGC GCGCGCAGCG TGCACGTCGC CACCCACTGC
ACTGAGGCGG ACACCTCGCC GCAGCACATC GCCTTCGCCC GCAAGCTGGG CATGGACACC
TCCGGCTTCC TGATGATGTC ACACCTCAAC GACCCGGCCG GCATCGCCCG GCAGGGCAAG
CTGATGGAGT CCTACGGCGC GCAGACCGTC TACGTCACCG ACTCGGCCGG CTACATGTTG
CCTGAGGACG TGAAGGCGCG CGTCGGCGCG CTGCGCGAGG TGCTGGCGCC GGAAACCGGG
ATCGGTTTCC ACGGCCACCA CAACCTGGGC ATGGGCATCG CCAACTCCAT CGCCGCCATC
GAGGCCGGCG CCAGCCGCAT CGACGGTTCG GTGGCGGGCC TCGGCGCCGG CGCCGGCAAC
ACGCCGCTGG AGGTGTTCGC CGCGGTGTGC GAGCGCATGG GCATCGACAC CGGCGTCGAT
CTGTTCAGGC TGATGGACGT GGCCGAGGAC ATCATCGTGC CGATGATGGA GCATGTGGTG
CGCGTCGACC GCGAGTCGCT GACCCTGGGC TACGCCGGCG TCTACTCGAC CTTCCTGCTG
CATTCCAAAC GCGCCGCCGA GCGCTTCGGC GTGCCGGCGC GCGACATCCT GGTCGAGCTG
GGCCGCAAGA AGATGATCGG CGGCCAGGAG GACATGATCC TCGACACCGC GATGAGCATG
GCCAAGGCGC GCGGGCTGCT GAAGAGCGCC TGA
 
Protein sequence
MMSQSDSLKG RKVVFHDMCL RDGMHAKREQ IGVEQMIAVA TALDAAGVPY IQVTHGAGLG 
GNSLQHGFAP HSNEEYIGAV AAKMKQAKVS VLLIPGLGTM KELQSAFDCG ARSVHVATHC
TEADTSPQHI AFARKLGMDT SGFLMMSHLN DPAGIARQGK LMESYGAQTV YVTDSAGYML
PEDVKARVGA LREVLAPETG IGFHGHHNLG MGIANSIAAI EAGASRIDGS VAGLGAGAGN
TPLEVFAAVC ERMGIDTGVD LFRLMDVAED IIVPMMEHVV RVDRESLTLG YAGVYSTFLL
HSKRAAERFG VPARDILVEL GRKKMIGGQE DMILDTAMSM AKARGLLKSA