Gene Avin_50040 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_50040 
SymboliolC 
ID7763855 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp5068941 
End bp5070884 
Gene Length1944 bp 
Protein Length647 aa 
Translation table11 
GC content68% 
IMG OID643807835 
Productmyo-inositol catabolism protein IolC 
Protein accessionYP_002802069 
Protein GI226946996 
COG category[S] Function unknown 
COG ID[COG3892] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGGCGAAA TCACCTTCGC AAGCGGACGT CAGTTGGACG TCATCTGTCT GGGGCGCCTC 
GGCGTCGACC TGTACGCCCA GCAGATCGGC GCACGGCTCG AGGACGTCGG CAGCTTCGCC
AAATACCTGG GCGGCTCGTC CGCCAACATC GCATTCGGTA CCGCCCGCCT GGGTCTCAAG
TCAGCCATGC TGACCCGCGT GGGCGACGAC CACATGGGCC GCTTCCTGAT CGAGGCGCTG
GAGCGCGAGG GCTGCGACAC CCGGGCGATC AAGGTCGACC CGGAACGCCT GACCGCGATG
GTCCTGCTGG GCATCAAGGA CCGCGATACC TTCCCGCTGA TCTTCTACCG CGAGAACTGC
GCCGACATGG CGCTGCGCGA GGAGGACATC GACGAAGCCT TCATCGCCTC CAGCAAGGCG
CTGTTGATCA CCGGCACCCA CTTCTCCACC GAAAGGGTCT ACAAGGCCAG CAGCAAGGCG
CTGGACTACG CCGAAAAGCA CAACGTCAAG CGCGTGCTGG ACATCGATTA CCGGCCGGTG
CTCTGGGGCC TGACCGGCAA GGCCGACGGC GAGACCCGCT TCATCGCCAG CGCCGAGGTC
AGCGCGCACG TGCAGCGCAT CCTGCCGCGC TTCGACCTGG TGGTCGGCAC CGAGGAGGAA
TTCCTCATCG CCGGCGGTTC CGAGGATCTG CTCAGCGCCC TGCGCAAGGT GCGCGAGGTG
ACCGCCGCGA CCCTGGTGGT CAAGCTCGGT CCGCTGGGCT GCACGGTCAT CCACGGCGCC
ATTCCGGCGC GCCTGGAGGA CGGCAACATC TATAAAGGCA TCCGTGTCGA GGTGATGAAC
GTGCTGGGCG CCGGCGACGC CTTCATGTCC GGCTTCCTGC GCGGCTGGCT GACCGGCGGT
GACGACGAGC GCTGCAGCCG TCTGGCCAAC GCCTGCGGCG GCCTGGTGGT ATCGCGCCAC
GCCTGCGCCC CGGCGATGCC GACCCCGGCC GAACTCGACT ACATCCTCAA CAGCCCGGTA
CCCATCACCC GCCCGGACCT CGACCCGCAC CTGAACCGCC TGCACCGGGT CAGCGTACCG
CGCAAGAACT GGAAGCCGCT GTTCATCTTC GCCTTCGACC ATCGCGGTCA ACTGGTGGAA
CTGGCCCAGC AGGCCGGACG CGACCTCGCG GCGATTCCCG AACTCAAGCA ACTGTTCATC
ACTGCCATCG AACGGGTCGA GGCCGATCTC CAGCGCCAGG GCATCGAAGG CGACGTGGGT
CTGCTGGCCG ACCAGCGCTT CGGCCAGGAC GCCCTCAACA GCGCCACCGG CCGCGGCTGG
TGGATCGGCC GCCCGGTCGA GCTGCAAGGC TCGCGGCCGC TGGCCTTCGA GCATGGCCGC
TCGATCGGCA GCAATCTGGT GCAGTGGCCG CGCGAGCACA TCATCAAGTG CCTGGTGCAG
TTCCACCCGG ACGACGAGCC CCTGCTGCGC CTGGAACAGG AGGGCCAGCT CAAGGGCCTC
TACGAGGCGG CCCAGGCCAG CGGCCACGAA CTGCTGCTCG AGGTGATCCC ACCGAAGAAC
CATCCCTCCA CGCATCCGGA CGTGCTCTAC CGGGCGATCA AGCGGCTCTA CAACATCGGC
ATCCATCCGG ACTGGTGGAA GATCGAGCCG CAGCCGGCCG AGGTGTACTC GAAACTCGAT
GCGCTGATCA CCGAACGCGA TCCCTACTGC CACGGCGTGG TCCTGCTCGG CCTCAATGCG
CCGGCCGAGG AACTCGCCGA AGGCTTCCGC CAGGCCGCCG GCAGCCAGGT CTGCCGCGGC
TTCGCGGTCG GCCGGACGAT CTTCCAGGAA CCCAGCCGCG CCTGGCTGGC CGGCGAGATC
GACGACGAGA CCCTGATCGC GCGGGTGCGG GCCACCTTCG AGTTCCTGAT CAAGTCCTGG
CGCGAGGCGC GCGGCCAGGT CTGA
 
Protein sequence
MGEITFASGR QLDVICLGRL GVDLYAQQIG ARLEDVGSFA KYLGGSSANI AFGTARLGLK 
SAMLTRVGDD HMGRFLIEAL EREGCDTRAI KVDPERLTAM VLLGIKDRDT FPLIFYRENC
ADMALREEDI DEAFIASSKA LLITGTHFST ERVYKASSKA LDYAEKHNVK RVLDIDYRPV
LWGLTGKADG ETRFIASAEV SAHVQRILPR FDLVVGTEEE FLIAGGSEDL LSALRKVREV
TAATLVVKLG PLGCTVIHGA IPARLEDGNI YKGIRVEVMN VLGAGDAFMS GFLRGWLTGG
DDERCSRLAN ACGGLVVSRH ACAPAMPTPA ELDYILNSPV PITRPDLDPH LNRLHRVSVP
RKNWKPLFIF AFDHRGQLVE LAQQAGRDLA AIPELKQLFI TAIERVEADL QRQGIEGDVG
LLADQRFGQD ALNSATGRGW WIGRPVELQG SRPLAFEHGR SIGSNLVQWP REHIIKCLVQ
FHPDDEPLLR LEQEGQLKGL YEAAQASGHE LLLEVIPPKN HPSTHPDVLY RAIKRLYNIG
IHPDWWKIEP QPAEVYSKLD ALITERDPYC HGVVLLGLNA PAEELAEGFR QAAGSQVCRG
FAVGRTIFQE PSRAWLAGEI DDETLIARVR ATFEFLIKSW REARGQV