Gene Avin_41490 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_41490 
Symbol 
ID7763031 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp4183435 
End bp4184997 
Gene Length1563 bp 
Protein Length520 aa 
Translation table11 
GC content68% 
IMG OID643807005 
Producthypothetical protein 
Protein accessionYP_002801256 
Protein GI226946183 
COG category[H] Coenzyme transport and metabolism 
COG ID[COG3071] Uncharacterized enzyme of heme biosynthesis 
TIGRFAM ID[TIGR02996] repeat-companion domain TIGR02996 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.510625 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
CTGCTGGTCG GCGAACTGGC GGGACAGCGC AACCGCTTCG ACATCGCCCT GGAAAACTAT 
GCCGTCCAGG CCCAGGCCAC CCAGGACCCC GGCATCGCCG AACGGGCCTT TCGCATCGCC
GAATACCTCG GTGCCGACCA GAACGCGCTG GACAGCGCGC TGATCTGGGC GGACAACGCC
CCGGCCGACC TCGACGCGCA GCGCGCCGCC GCCGTCCAGC TGGCGCGCAA CGGCCGCTAC
GACGACTCCA TGCGCTACAT GGAGCGGGTC CTGCAGGCAC AGGGCGACAC CCATTTCGAC
TTCCTCGCCC TGGCCGCGGC GGAAAGCGAC CCGGAAACCC GCGCCGGACT GCTGCAGAGT
TTCGACCGGC TGCTCGCCAA GTCGCCGGAA AACGGCCAGT TGCTGTTCGG CAAGGCACTG
CTGCTGCAGC AGGACAACCA GCCCGAGGCC GCCCTCGAAC TGCTCGAGAA ACACCCGGCG
AGCGAGCACG AAATCGCCCC GCTGCTGCTG CGCGTGCGCC TGCTGCAGGG ACTGGGACGC
AGCGACGAGG CGAACGCCCT GCTGAAGAAG GGCATGCGCG AGTACCCCGA CGACAAGCGC
CTGCGCCTGA CCTACGCCCG CCTGCTGGTC GAGCAGGGCC GCCTCGACGA TGCCAAGGAC
GAGTTCGTCA CCCTGGTCCG GCAGTTTCCC GAGGACGACG ACCTGCGCCT GTCCCTGGCG
CTGGTCTGCC TGGAAGCCAA GGACTGGGAC GAGGCCGTGC TCTACCTGGA GGAACTGATC
GAACGCGGCA GCTACGTGGA TGCCGCGCAC TACCAGATGG GGCGCGTCTA CGAGGAACGC
GAGGACCCGG AAAGCGCACT GATCGAATAC GCCTTGGTCG GCCCCGGCAA CCATTACCTG
CCGGCGCAAC TGCGCCAGAC CGAGATCCTC TTCGAGCGCG GCCGCCCGGA AGAAGCCTCG
GCGCGTCTGG CGCAGGCCCG CGAAAACCAG CCGGACTACG CCCTGCAGCT CTATCTGGTC
GAGGCCGAGG CACTGACCAA CCGCGAACGG CTCGACGAGG CCTGGCAGGT CATCGAGCGC
GCCCTGCGCC ACTTCCCGGA CGACCTCAAC CTGCTCTACA CCCGCGCCAT GCTGGCCGAG
AAGCGCAACG ACCTGACCCA GCTGGAGCGC GACCTGCGCT TCATCATCGC GCGCGAACCG
GACAACGCCA TGGCCCTGAA CGCCCTCGGC TACACCCTGG CCGACCGCAC CACCCGCTAC
GCCGAGGCCA AGGCACTGAT CGAACAGGCC CACCAGCTCA ACCCGGACGA TCCGTCGATT
CTCGACAGCC TCGGCTGGGT GAACTACCGC CTGGGCCAGT TGGGCGAAGC CGAACGCCTG
CTGCGTCAGG CCGCCGAGCG CATTTCCGAC CACGAGATCG CCGCCCACCT GGGCGAAGTG
CTCTGGACCC GTGGCAAGCA GCGCGAGGCC CGCAAGGTCT GGGCCAAGGC GCTCGAGGAA
CAGCCCGACA GCGCCATTCT GCGCAGCACC CTGCTGCGCC TGACCGGATC CGAGACCTTC
TGA
 
Protein sequence
MLVGELAGQR NRFDIALENY AVQAQATQDP GIAERAFRIA EYLGADQNAL DSALIWADNA 
PADLDAQRAA AVQLARNGRY DDSMRYMERV LQAQGDTHFD FLALAAAESD PETRAGLLQS
FDRLLAKSPE NGQLLFGKAL LLQQDNQPEA ALELLEKHPA SEHEIAPLLL RVRLLQGLGR
SDEANALLKK GMREYPDDKR LRLTYARLLV EQGRLDDAKD EFVTLVRQFP EDDDLRLSLA
LVCLEAKDWD EAVLYLEELI ERGSYVDAAH YQMGRVYEER EDPESALIEY ALVGPGNHYL
PAQLRQTEIL FERGRPEEAS ARLAQARENQ PDYALQLYLV EAEALTNRER LDEAWQVIER
ALRHFPDDLN LLYTRAMLAE KRNDLTQLER DLRFIIAREP DNAMALNALG YTLADRTTRY
AEAKALIEQA HQLNPDDPSI LDSLGWVNYR LGQLGEAERL LRQAAERISD HEIAAHLGEV
LWTRGKQREA RKVWAKALEE QPDSAILRST LLRLTGSETF