Gene Avin_33090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_33090 
Symbol 
ID7762205 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3382538 
End bp3383599 
Gene Length1062 bp 
Protein Length353 aa 
Translation table11 
GC content74% 
IMG OID643806175 
ProductCobalamin biosynthesis CobC protein 
Protein accessionYP_002800439 
Protein GI226945366 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0079] Histidinol-phosphate/aromatic aminotransferase and cobyric acid decarboxylase 
TIGRFAM ID[TIGR01140] L-threonine-O-3-phosphate decarboxylase 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0570805 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCTTGAGC ACGGCGGCCG TCTGCGCGTC GCGGCGCGGT ATTACGGCAT TCCCCAGGGC 
GACTGGCTCG ACCTGTCCAC CGGCATCGCC CCCGAGCCCT GGCCCGTGCC GTCCATGTCG
TCCGATGCCT GGGCGCGCCT GCCGGAAGAG GACGACGGCC TGGCCGAGGC CGCCTGCGCC
TGCTACGGCG CCGCCCAGGC CTTGCCGGTG GCTGGCAGCC AAGCGGCGAT CCAGGCGCTG
CCGGCGCTGT TCGCGGCGGG AAGGGTTGGC GTGCTGGCGC CGAGCTACGC CGAGCACGCC
CAGGCCTGGC GACGCGCCGG TCACCGGCTG CTCCATCTGG CCGCCGGAGA CATCGAGGCG
CGCCTCGACG AGCTCGACAT GCTGGTGCTG GCCAACCCCA ACAACCCCAC CGGCGAGCGC
TTCGAGCCGT CCCGGCTGCT CGACTGGCAG GCGCGGCTGG CCCGGCACGG CGGCTGCCTG
CTGGTCGACG AGGCGTTCAT GGACTGCACG CCCGAATACA GCCTGGCGGC CCACAGCCAA
CGGCCGGGGC TGGTCGTGCT GCGCTCGTTC GGCAAGTTCT TCGGCCTGGC CGGCGTTCGC
CTGGGCTTCG TGCTGGCCGA GACGGGACTG CTGGCGCGGC TGCACGAGCG CCTCGGTCCC
TGGACGGTCA GCGGGCCGGC GCGGGCGCTC GGCCTGCAGG CGCTGGGGCC GGCCGGCGGC
GCGGCGCGCG AACGACGCGC CGGGCAATTG CGGGCGGCGG GAAAGCGGCT GGCGGCCTTG
CTGGACGCGC ACGGGTTGGC GCCGGCCGGC AGCACCGCGC TGTTCCAGTG GGTGCGGATG
CCCGATGCGG CGCGGTTGCA CGACTTTCTC GCCCGCCAGG GCATTCTGGT GCGCCTGTTT
GAGACGCCCG CCAGCCTGCG CTTCGGCCTG CCGGCGGACG AGCGCGGCTG GCAACGGCTG
GCGCAGGCGC TGGCCGACCC GGCACGACCC CGTGGGCGCG GACTCGTTCG CGAGCGGGCG
GGGACGATCG CCGACGAATC CGCAAGCAGG GAGTTTCCAT GA
 
Protein sequence
MLEHGGRLRV AARYYGIPQG DWLDLSTGIA PEPWPVPSMS SDAWARLPEE DDGLAEAACA 
CYGAAQALPV AGSQAAIQAL PALFAAGRVG VLAPSYAEHA QAWRRAGHRL LHLAAGDIEA
RLDELDMLVL ANPNNPTGER FEPSRLLDWQ ARLARHGGCL LVDEAFMDCT PEYSLAAHSQ
RPGLVVLRSF GKFFGLAGVR LGFVLAETGL LARLHERLGP WTVSGPARAL GLQALGPAGG
AARERRAGQL RAAGKRLAAL LDAHGLAPAG STALFQWVRM PDAARLHDFL ARQGILVRLF
ETPASLRFGL PADERGWQRL AQALADPARP RGRGLVRERA GTIADESASR EFP