Gene Avin_50100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50100 
SymboliolD 
ID7763861 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5076837 
End bp5078774 
Gene Length1938 bp 
Protein Length645 aa 
Translation table11 
GC content67% 
IMG OID643807841 
Productmyo-inositol catabolism protein IolD 
Protein accessionYP_002802075 
Protein GI226947002 
COG category[E] Amino acid transport and metabolism 
COG ID[COG3962] Acetolactate synthase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.443282 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGTACGA TACGACTGAC CATGGCCCAG GCCCTGGTGA AATTCCTCGA CAACCAATAC 
GTCAGCGTCG ACGGTGTCGA GAGCAAGTTC GTCAAAGGCA TCTTCACCAT TTTCGGCCAC
GGCAACGTGC TGGGCCTGGG GCAGGCGCTG GAGCAGGATC CGGGCGAGCT GATCGTCCAC
CAGGGCCGCA ACGAGCAGGG CATGGTCCAC GCCGCCATCG GCTTCGCCAA GCAGAAGATG
CGCCGGCAGA TCTACGCCTG CACCTCCTCG GTCGGCCCCG GCGCGGCGAA CATGATCACC
GCCGCCGCCA CCGCCACCGC CAACCGCATT CCCGTGCTGC TGCTGCCGGG CGACGTCTAC
GCCACCCGCC AGCCCGACCC GGTGCTGCAG CAGATCGAGC AGAGCCACGA CCTGTCGATC
AGCACCAACG ACGCCTTCCG GGCGGTCAGC AAGTACTGGG ACCGCGTCAG CCGCCCCGAA
CAACTGATGA GCGCGGCGAT CAACGCCATG CGCGTGCTCA CCGATCCGGC CGAGACCGGC
GCGGTGACCC TGGCGCTGCC GCAGGACGTG CAAGGCGAGG CCTACGACTA CCCGGACTAC
TTCTTCGCCA AACGGGTGCA CCGCATCGAC CGCCGCCCGG CCACCGCGGC CATGCTGGCC
GACGCCGTGG CGCTGCTCAA GGGCAAGCGC AAGCCGCTGC TGATCTGCGG CGGCGGGGTG
AAATACTCCG GCGCCGCCGA GGCGTTGCAG CGTTTCGCCG AGCGTTTCGA GATTCCCTTC
GCCGAGACCC AGGCCGGCAA GAGCGCCATC GTCTCCGCCC ACCCGCTGAA CGTCGGCGGC
ATCGGCGAGA CCGGCTGCCT GGCGGCTAAC CTGCTGGCCA GGGAGGCCGA CCTGGTGATC
GGCGTCGGTA CCCGCTACAC CGACTTCACC ACCGCCTCCA AGTGGATCTT CCAGAACCCC
GAAGTGGCCT TCCTCAACCT CAACGTCAGC GCCTTCGACG CCTACAAGCT CGACGCCGTG
CAGGTGGTGG CCGACGCCCG GGCCGGCCTG GAGGCGCTCG GCGAAGCCCT CGGCCACGGC
GGCTACCGTG CCCAGTGGGG CGAGGCGACG GCGCAGGCCA AGGCCAGGCT GAAGGCGGAA
GTCGACCGCG TCTACGCCGT GGAATACAGC GGCGAAGGCT TCGTCCCGGA GATCGACGAC
CACCTGCCGC GCAGCGTGCT GGAAGAGTTC ATCGAACTGA CCGGCTCCAG CCTGACCCAG
AGCCAGGTGC TCGGCGTGCT CAACCGGACC CTGGCCGACG ACGCCATCAT CGTCGGCGCC
TCCGGCAGCC TGCCGGGCGA CCTGCAGCGG ATCTGGCGTT GCAAGGGGAC CGACACCTAC
CACATGGAGT ACGGCTACTC CTGCATGGGC TACGAGGTGA ACGCCGCCCT CGGGGTGAAA
ATGGCCGAGC CCGAGCGCGA GGTCTACACC CTGGTCGGCG ACGGCTCCTA CATGATGCTG
CACTCGGAGC TGCCCACCTC CATCCAGGAG CGCCGCAAGA TCAACATCGT CCTCTTGGAC
AACATGACCT TCGGCTGCAT CAACAACCTG CAGATGGAAC ACGGCATGGA CAGCTTCGGC
ACCGAGTTCC GCTACCGCAA CCCGGAGACC GGCAAGCTCG ACGGCGGCTT CGTGCCGGTC
GACTTCGCCA TGAGCGCCGC GGCCTACGGC TGCAAGACCT ACCGGGTGAA GACCCTCGAC
GAGCTGCATG CGGCGCTGGA AGACGCCCGC CGGCAGAGCG TTTCCACCCT CATCGACATC
AAGGTGCTGC CCAAGACCAT GATCCACAAG TACCTGTCCT GGTGGCGGGT CGGTGGCGCC
CGGGTCTCGA AGAGCGAGCG CATCGCGGCG GTCGCGCGGA TGCTCGAGGA CAACATCGCC
AAGGCCCGGC AGTACTGA
 
Protein sequence
MSTIRLTMAQ ALVKFLDNQY VSVDGVESKF VKGIFTIFGH GNVLGLGQAL EQDPGELIVH 
QGRNEQGMVH AAIGFAKQKM RRQIYACTSS VGPGAANMIT AAATATANRI PVLLLPGDVY
ATRQPDPVLQ QIEQSHDLSI STNDAFRAVS KYWDRVSRPE QLMSAAINAM RVLTDPAETG
AVTLALPQDV QGEAYDYPDY FFAKRVHRID RRPATAAMLA DAVALLKGKR KPLLICGGGV
KYSGAAEALQ RFAERFEIPF AETQAGKSAI VSAHPLNVGG IGETGCLAAN LLAREADLVI
GVGTRYTDFT TASKWIFQNP EVAFLNLNVS AFDAYKLDAV QVVADARAGL EALGEALGHG
GYRAQWGEAT AQAKARLKAE VDRVYAVEYS GEGFVPEIDD HLPRSVLEEF IELTGSSLTQ
SQVLGVLNRT LADDAIIVGA SGSLPGDLQR IWRCKGTDTY HMEYGYSCMG YEVNAALGVK
MAEPEREVYT LVGDGSYMML HSELPTSIQE RRKINIVLLD NMTFGCINNL QMEHGMDSFG
TEFRYRNPET GKLDGGFVPV DFAMSAAAYG CKTYRVKTLD ELHAALEDAR RQSVSTLIDI
KVLPKTMIHK YLSWWRVGGA RVSKSERIAA VARMLEDNIA KARQY