Gene Avin_40120 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_40120 
Symbol 
ID7762898 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4066194 
End bp4067867 
Gene Length1674 bp 
Protein Length557 aa 
Translation table11 
GC content68% 
IMG OID643806871 
Product2-isopropylmalate synthase 
Protein accessionYP_002801123 
Protein GI226946050 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0119] Isopropylmalate/homocitrate/citramalate synthases 
TIGRFAM ID[TIGR00970] 2-isopropylmalate synthase, yeast type 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.397785 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCATGC TCAAAGATCC GTCGCAGAAA TACCGCCCCT TCGCGCCGAT CGCCCTGCGC 
GACCGCACCT GGCCGGACCG GGTGATCGAC AAGGCGCCGC TCTGGCTGAG TACCGATTTG
CGCGACGGCA ACCAGTCGCT GATCGAGCCG ATGGATGCCG CGAAGAAGAT GCGCTTCTTC
AAGACCCTGG TGCAGGTCGG CCTGAAAGAG ATCGAGGTGG GCTTCCCGTC CGCCTCGCAG
ACCGATTTCG ACTTCGTCCG CGAACTGATC GAGGGCGGCC ACATCCCCGA CGACGTAACC
ATCCAGGTGC TGACCCAGGC CCGCGACGAC CTCATCGAGC GGACCTTCGA ATCGCTGAAG
GGTGCGAAGA AGGCCATCGT CCACTACTAC AACGCCTGCG CGCCGAGCTT CCGGCGCATC
GTGTTCGACC AGGACAAGGA AGGCGTCAAG CGGATCGCCG TCGCCGCCGG CCGGACCATC
AAGCGCCTGG CCGCCGCCGC GCCGGAAACC CGGTGGGGCT TCGAGTATTC CCCCGAGGTG
TTCAGCTCCA CCGAGAGCGA TTTCGCCGTC GAGGTGTGCA ACGCGGTGGT CGAGGTGTTC
CAGCCGACCC CGGCCAACCG CCTGATCCTC AACCTGCCGG CCACCATCGA ATGCGCCACG
CCGAACCACT ACGCCGACCA GATCGAGTGG TTCTGCCGGC ATGTCGACAG GCGCGACAGC
GTCATCGTCA GTCTGCACAC CCACAACGAC CGCGGCACCG GCGTGGCCGC CAGCGAGCTG
GGCCTGATGG CCGGCGCCGA CCGCGTCGAG GGCTGCCTGT TCGGCAACGG CGAGCGTACC
GGCAACGTCG ACCTGGTGAC CCTGGCGCTG AACCTCTACA CCCAGGGCGT CGACCCCGGG
CTGGACTTCT CCGACATCGA CGCGGTGCGC AAGGTGGTCG AGGAATGCAA CCAGTTGCCG
GTACACCCGC GCCATCCCTA CGTCGGCGAC CTGGTGCACA CCGCCTTCTC CGGCTCGCAC
CAGGACGCGA TCCGCAAGGG CTTCGCCCAG CAGGACCCGG AGGGCGTCTG GGAGGTGCCC
TATCTGCCGA TCGACCCGGC CGACATCGGC CGCAGCTACG AGGCGGTGAT CCGCGTCAAC
AGCCAGTCGG GCAAGGGCGG CATCGCCTAC CTGCTCGAAC AGGAGTACGG CATCAGCCTG
CCGCGGCGCA TGCAGATCGA GTTCAGCCAG GTGGTGCAGA AGGAGACCGA TCGCCTCGGC
CTGGAGATGA GCGCCGCGCA GATCCACGCG CTGCTCGAAG CCGAGTACCT GCGCGCCGAG
ACGCCCTACG CCTTGAAGGG CCATCGCCTG CAGGAGGAGA ACGGTACCTG CGCGCTGGAC
GTGGAAGTCT TCGACAAGGG CGAGAGCCGC CATTGGCGCG GCATCGGCAA GGGCCCGCTG
GAGGCGCTGG TCGCCTGCCT GCCGGTCCGC GTGGAGATCA TGGACTACCA CGAGCACGCC
ATCGGCGCCG GCAGCCATGC CAGGGCCGCG GCCTACATCG AGCTGCGCCT CGACGGCCAG
CGTTCGCTGC ACGGCCTGGG CATCGACGAG AACCTGACCA CGGCGAGCAT CCGCGCCCTG
TTCAGTGCCC TCAACCGCGC CCTCGGCCAG CAGGCGTCGA TCCGCGCGGC CTGA
 
Protein sequence
MPMLKDPSQK YRPFAPIALR DRTWPDRVID KAPLWLSTDL RDGNQSLIEP MDAAKKMRFF 
KTLVQVGLKE IEVGFPSASQ TDFDFVRELI EGGHIPDDVT IQVLTQARDD LIERTFESLK
GAKKAIVHYY NACAPSFRRI VFDQDKEGVK RIAVAAGRTI KRLAAAAPET RWGFEYSPEV
FSSTESDFAV EVCNAVVEVF QPTPANRLIL NLPATIECAT PNHYADQIEW FCRHVDRRDS
VIVSLHTHND RGTGVAASEL GLMAGADRVE GCLFGNGERT GNVDLVTLAL NLYTQGVDPG
LDFSDIDAVR KVVEECNQLP VHPRHPYVGD LVHTAFSGSH QDAIRKGFAQ QDPEGVWEVP
YLPIDPADIG RSYEAVIRVN SQSGKGGIAY LLEQEYGISL PRRMQIEFSQ VVQKETDRLG
LEMSAAQIHA LLEAEYLRAE TPYALKGHRL QEENGTCALD VEVFDKGESR HWRGIGKGPL
EALVACLPVR VEIMDYHEHA IGAGSHARAA AYIELRLDGQ RSLHGLGIDE NLTTASIRAL
FSALNRALGQ QASIRAA