Gene Avin_30370 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAvin_30370 
SymbolcycH 
ID7761937 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameAzotobacter vinelandii DJ 
KingdomBacteria 
Replicon accessionNC_012560 
Strand
Start bp3148536 
End bp3149753 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content73% 
IMG OID643805909 
Productcytochrome c biogenesis protein 
Protein accessionYP_002800177 
Protein GI226945104 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG4235] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR03142] cytochrome c-type biogenesis protein CcmI 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCGATT TCTGGATCGC CGCCGGCCTG CTGCTGCTGC TGGCCCTGGC CTTTCTGCTG 
CTCCCCGTGC TGCGCACTCG TCGCGCCCAG GCCGAGGAGG ACCGTACCGC GCTCAATGTG
GCGCTCTACG AAGAGCGTCT GGCCGAGCTG GACGCCCAAC GCGCGGCCGG CACCCTGAGC
GCCGGGCAAC TGGAGGCCGG CCGCGCCGAG GCGGCCCGCG AGCTGCTGGC CGACACCGAA
GCCGGCGAGG CGCCGCGCCG CTCCTCCTTG GGCAGGGCGG TGCCGCTGGC GGTGGCGCTG
CTGGTGCCGC TGCTCGGCTA CGGCCTCTAC CTGCACTGGG GCGCCAGCGA CAAACTCGAA
CTGGCCCGCC GGTTCGCCGA ACAGCCGAAG AGCATCGAAG AGATGACCGC GCGCCTGGAG
CAGGCGGTCA AGGTCCAGCC GGATTCCGCC GAGGGCTGGT ATTTCCTGGG GCGCACCTAC
ATGGCCGAGG AGCGTCCGGC CGACGCCGTA GCGGCCTTCG AGCAGGCCGC CCGGCTGGCC
CAGCGGCCGC CGGAGATCCT CGGCCAGTGG GCCCAGGCGC TGTATTTCGC CGAGGGCAAG
CGCTGGAGCC CGCGGATGCA GGCGCTGACC GACGAGGCGC TGGCCGGCGA GCCGGCCGAG
GTCACCAGCC TCGGCCTGCT CGGCATCGTC GCCTTCGAGG AGCGCCGCTT CGCCGATGCC
GCCGGTTACT GGGAGCGCCT GGTGGAAATC CTGCCCGAGG GCGATCCGTC GCGGGCGGCC
ATCGCCGGCG GCATCGCCCG GGCGCGCGAG CAGGCCGGCG CGTCGCAGGG CGCGGCGCCG
GCCGCCGCCC AGGTGGAGCT GAAGGTCAGC GTCGCCCTGG CGCCGGAGCT GGCCGGCAAG
GTGCGGCCGG ACGACAGCGT GTTCGTCTTC GCCCGCGCCG TTTCCGGTCC GCCGATGCCG
CTGGCGGTCG AGCGTCTGCG CGTGGCCGAT CTGCCGGCGC AGGTCGCCCT GAGCGATGCC
GATGCGATGA TGCCCCAGCT CAAGCTGTCC AACTTCGCCG AGGTGCAACT GGTGGCCCGG
ATCTCGCGGG CCGGCGATCC CACTGCGGGC GACTGGGTCG GCCAGCTCGA GCGGGTGAGC
GCCAGGGCAT CGGGCGAATA CGTCCTGACC ATCGATCGAG CCGACGCGCC CCGGGGGCGC
CCTGGAGAGG ACCGATGA
 
Protein sequence
MIDFWIAAGL LLLLALAFLL LPVLRTRRAQ AEEDRTALNV ALYEERLAEL DAQRAAGTLS 
AGQLEAGRAE AARELLADTE AGEAPRRSSL GRAVPLAVAL LVPLLGYGLY LHWGASDKLE
LARRFAEQPK SIEEMTARLE QAVKVQPDSA EGWYFLGRTY MAEERPADAV AAFEQAARLA
QRPPEILGQW AQALYFAEGK RWSPRMQALT DEALAGEPAE VTSLGLLGIV AFEERRFADA
AGYWERLVEI LPEGDPSRAA IAGGIARARE QAGASQGAAP AAAQVELKVS VALAPELAGK
VRPDDSVFVF ARAVSGPPMP LAVERLRVAD LPAQVALSDA DAMMPQLKLS NFAEVQLVAR
ISRAGDPTAG DWVGQLERVS ARASGEYVLT IDRADAPRGR PGEDR