Gene Avin_10230 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_10230 
Symbol 
ID7759968 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp971161 
End bp972555 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content63% 
IMG OID643803928 
ProductDi-heme cytochrome c peroxidase, CCP_MauG family 
Protein accessionYP_002798230 
Protein GI226943157 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG1858] Cytochrome c peroxidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAAAAGC TACCCCTCGT CATTGGAGCC TGCGTCGCCG GCTACTTGGC AACCGTCTTC 
ATCGTAGACC GTTTCGACGT GCGTCTCAGC GAACAGCATC TGGCCAGTGC CCAACTGAAC
GGCATGGACA AGCTCACGAG CGAAGCCTTC AAGGTGCTGA ACAGCAATGG TTGCCAGTAT
TGCCATACCC GCAACAGCGA GCTGCCGTTC TACGCCAACA TGCCGATCGC CAAGCAACTG
ATGAACAAGG ACATCGAGCT GGCCCAGCGC CAGTTCAACA TCGAGTCGCT GCTGGCCAGC
GCGCAACAGG GCAAGGCGGT CTCGGAAGTG GACCTGGCCA AGATCGAGTC GGTGATGCAG
GACAACGCCA TGCCGCCGAA CCTCTACCTG GGCATGCACT GGCGGTCCCG GCTGTCCGAC
GAGGAGAAGG GCGTGCTGCT CGACTGGGTG AAGGCCGAGC GCCTGAAGCA GAGTTCGGCC
GATGCGGTCG CCGACGCCTA CAAGTACGAG CCGGTGCAGC CGATCACCAC CAGCTTTCCG
GTGAACCCGG CCAAGGTCGC GCTGGGCGAG AAGCTCTACC ACGATACCCG CCTGTCCAGC
GACGACACCG TCTCCTGCGC CAGTTGCCAT GCCCTGGACA AGGGCGGGGT GGACCGCCTG
GATGTTTCCG TCGGGGTCGG CGGCTCGAAG GGGCCGATCA ACGCACCGAC GGTGTACAAC
GCCGCCTTCA ACGTCCTGCA GTTCTGGGAC GGCCGCGCGG CCGACCTGCA GAAGCAGGCC
GGCGGCCCGC CGATGAACCC GCTGGAGATG GCTTCCACCT CCTGGGAGCA GATCGTCGGC
AAGCTGACGC AGGACGCCCA GTTGAGCGCC GAGTTCGCCG CCCTCTATCC GGAAGGCATC
ACCGAGAACA GCATCACCGA CGCCATCGCC GAGTTCGAGA AGACCCTGGT CACGCCGAAC
AGCCGCTTCG ACCTGTTCCT GAAAGGGCAG GGCGACGCTC TCAGCAGCGT GGAGAAGGAA
GGCTACGAAC TGTTCAAGAC CGCCAAGTGC GCGACCTGCC ACGTGGGCGA GGCGATGGGC
GGCCAGTCCT TCGAACTGAT GGGCATCAAG AAGGATTATT TCGCCGACCG CGGCAATGTC
AGCGAAGTGG ACCACGGGCG TTACAACGTG ACCAAGGACC CGCACGACAT GTACCGCTTC
AAGGTGCCGA CCCTGCGCAA CGTCGCGCTG ACCGCGCCCT ATTTCCACGA CGCCAGCGCC
AAGACCCTGG AGGACGCGGT CGACAAGATG GCGGAGTACC AGGTCGGCAT GAAACTGTCG
AAGGACGAGA TCGGCAAGAT CGTCGCCTTC CTGCAGACCC TCAACGGCGA GTACCAGGGC
AAGACCCTGC AGTGA
 
Protein sequence
MKKLPLVIGA CVAGYLATVF IVDRFDVRLS EQHLASAQLN GMDKLTSEAF KVLNSNGCQY 
CHTRNSELPF YANMPIAKQL MNKDIELAQR QFNIESLLAS AQQGKAVSEV DLAKIESVMQ
DNAMPPNLYL GMHWRSRLSD EEKGVLLDWV KAERLKQSSA DAVADAYKYE PVQPITTSFP
VNPAKVALGE KLYHDTRLSS DDTVSCASCH ALDKGGVDRL DVSVGVGGSK GPINAPTVYN
AAFNVLQFWD GRAADLQKQA GGPPMNPLEM ASTSWEQIVG KLTQDAQLSA EFAALYPEGI
TENSITDAIA EFEKTLVTPN SRFDLFLKGQ GDALSSVEKE GYELFKTAKC ATCHVGEAMG
GQSFELMGIK KDYFADRGNV SEVDHGRYNV TKDPHDMYRF KVPTLRNVAL TAPYFHDASA
KTLEDAVDKM AEYQVGMKLS KDEIGKIVAF LQTLNGEYQG KTLQ