Gene Avin_44600 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_44600 
SymbolthiC 
ID7763331 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4510561 
End bp4512447 
Gene Length1887 bp 
Protein Length628 aa 
Translation table11 
GC content66% 
IMG OID643807312 
Productthiamine biosynthesis protein ThiC 
Protein accessionYP_002801553 
Protein GI226946480 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG0422] Thiamine biosynthesis protein ThiC 
TIGRFAM ID[TIGR00190] thiamine biosynthesis protein ThiC 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.168262 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAACGTGA CAGAACAACA GAAGAACCTG AGCGAGTCCG CCCAGGTCGA CCAGCAGTCC 
GTACAGCCCT TCCCGCGTTC GCGGAAGATC TACGTCACCG GCTCGCGCCC GGACATCCGC
GTGCCGATGC GCGAGATCGG CCTGGACGTG ACACCCACCG CGTTCGGCGG CGAGATCAAC
CCGCCGGTCA CCGTCTACGA CACCTCCGGC CCCTACACCG ACCCGAACGT CGTCATCGAC
GTGCGCAAGG GGCTGGCCGA CGTGCGCAGC GCCTGGATCG AGGATCGCGG CGACACCGAG
AAGCTGCCGG GCCTGACCTC GGAGTTCGGC CAGCGCCGCC TCTCCGACCC GGAACTGGCC
GCCATGCGCT TCGCCCACGT ACGCAACCCG CGCCGCGCCA AGGCCGGGCA CAACGTCACG
CAGATGCACT ATGCCAGGAA AGGCATCGTC ACCCCGGAGA TGGAATACGT CGCCATCCGC
GAGAACATGA AGCTCGCCGA GGCGCGCGAG GCCGGCCTGC TCGCCGCGCA GCATCCGGGG
CAGAGCTTCG GCGCCAGCAT TCCGAAGGAA ATCACCCCCG AGTTCGTCCG AGCCGAAGTC
GCCCGCGGCC GCGCCATCAT CCCGGCCAAC ATCAACCACG TGGAGCTGGA GCCGATGATC
ATCGGCCGCA ACTTCCTGGT GAAGATCAAC GGCAATATCG GCAACTCGGC GCTGGGATCG
AGCATCGAAG AGGAAGTGGC CAAGCTGACC TGGGGCATCC GCTGGGGCTC GGACACCGTG
ATGGACCTGT CCACCGGCAA GCACATCCAC GAGACCCGCG AGTGGATCAT CCGCAACTCG
CCGGTGCCGA TCGGCACGGT GCCGATCTAC CAGGCGCTGG AAAAGGTCGG CGGCATCGCC
GAGGACCTGA CCTGGGAGCT GTTCCGCGAC ACCCTGATCG AGCAGGCCGA GCAGGGCGTG
GACTACTTCA CCATTCATGC CGGCGTGCTG CTGCGCTACG TGCCGCTGAC CGCGAAAAGG
GTCACCGGCA TCGTCTCCCG CGGCGGTTCG ATCATGGCCA AATGGTGTCT GGCGCATCAC
CAGGAAAACT TCCTCTACAC CCACTTCGAA GACATCTGCG ACATCATGAA GGCCTACGAT
GTCAGCTTCT CCCTGGGCGA CGGCCTGCGC CCCGGCTCGG TGGCCGATGC CAACGACGCC
GCCCAGTTCG GCGAACTGGA AACCCTCGGC GAACTGACCA GGGTCGCCTG GAAGCACGAG
GTGCAGACCA TCATCGAAGG CCCCGGCCAC GTGCCGATGC ACATGATCAA GGAGAACATG
GACAAGCAGC TCGAATGCTG CGACGAGGCG CCGTTCTACA CCCTCGGCCC GTTGGTCACC
GACATCGCCC CCGGTTACGA CCACATCACC TCCGGCATCG GCGCGGCGAT GATCGGCTGG
TTCGGCTGCG CCATGCTCTG CTACGTGACG CCCAAGGAGC ACCTCGGCCT GCCGAACAAG
GACGATGTGA AGACCGGCAT CATCACCTAC AAGATCGCCG CGCACGCCGC CGATCTGGCC
AAGGGCCATC CGGGCGCGCA GATCCGCGAC AACGCGCTGA GCAAGGCGCG CTTCGAGTTC
CGCTGGGAGG ACCAGTTCAA CCTCGGCCTC GATCCGGACA CCGCGCGTGC CTTCCACGAC
GAGACCCTGC CCAAGGACTC GGCCAAGGTG GCGCACTTCT GCTCCATGTG CGGGCCGAAG
TTCTGTTCGA TGAAGATCAC CCAGGAAGTG CGCGACTACG CCGCCGAGCA CGGCCTGTCC
GAGGAAAGCC AGGCCGTCGA GGCCGGTTTC CGGGAGCAGG CCGAGCGCTT CCGCGAAGAG
GGCTCGGTGA TCTACAAGCA GGTCTGA
 
Protein sequence
MNVTEQQKNL SESAQVDQQS VQPFPRSRKI YVTGSRPDIR VPMREIGLDV TPTAFGGEIN 
PPVTVYDTSG PYTDPNVVID VRKGLADVRS AWIEDRGDTE KLPGLTSEFG QRRLSDPELA
AMRFAHVRNP RRAKAGHNVT QMHYARKGIV TPEMEYVAIR ENMKLAEARE AGLLAAQHPG
QSFGASIPKE ITPEFVRAEV ARGRAIIPAN INHVELEPMI IGRNFLVKIN GNIGNSALGS
SIEEEVAKLT WGIRWGSDTV MDLSTGKHIH ETREWIIRNS PVPIGTVPIY QALEKVGGIA
EDLTWELFRD TLIEQAEQGV DYFTIHAGVL LRYVPLTAKR VTGIVSRGGS IMAKWCLAHH
QENFLYTHFE DICDIMKAYD VSFSLGDGLR PGSVADANDA AQFGELETLG ELTRVAWKHE
VQTIIEGPGH VPMHMIKENM DKQLECCDEA PFYTLGPLVT DIAPGYDHIT SGIGAAMIGW
FGCAMLCYVT PKEHLGLPNK DDVKTGIITY KIAAHAADLA KGHPGAQIRD NALSKARFEF
RWEDQFNLGL DPDTARAFHD ETLPKDSAKV AHFCSMCGPK FCSMKITQEV RDYAAEHGLS
EESQAVEAGF REQAERFREE GSVIYKQV