Gene Nmul_A1719 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1719 
Symbol 
ID3786196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1961695 
End bp1963008 
Gene Length1314 bp 
Protein Length437 aa 
Translation table11 
GC content57% 
IMG OID637811806 
Productcytochrome c, class I 
Protein accessionYP_412409 
Protein GI82702843 
COG category[C] Energy production and conversion 
COG ID[COG2010] Cytochrome c, mono- and diheme variants 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.444977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAAAATTG CTGTCATCGC TGCCGCAGTG GCAGTTCTTG CGGGTATCGG CCTATCTCCT 
GAAATAGTTG GCGCACCTTC TCCGAAAAAG GAAGCAAAGG CAGAAAAGAA TGCTACGCCG
TTATCATCGG TTCTTTTGCA ACAGCGGCGA GCGCGCGGAG CATATCTTGC ACGAGCCGGA
AATTGCATAG GATGCCATAC AGCTCAAGGT GGAGGCGCCT ATGCCGGCGG CAGGAAGCTT
TCGACTCCAT TTGGCACGTT CGTTACATCC AATATCACGC CGGATAAGGC AACCGGAATC
GGTGACTGGG ATGAAGACGA TTTCTGGAAA GCGCTGCACG AGGGTAAATC GCGCGATGGG
AGGCTTCTGT ATCCCGCGTT TCCGTATACC GAATATACGA AAGTCACGCG CGAGGATTCC
GATGCGATTT TTGCCCATCT TCAAGCCCTC GAACCCGTCG TTCAACAGAA TCCGCCAAGC
CAGGTTGCTT CCCGATACGA TTTCCAGCCA TTGCTGACTC TTTGGCGTGC CGCTTATTTC
AAGCCGGGCG TGTATCAGGC TGATCCCGCC AAAAGCACTG AGTGGAACCG GGGTGCTTAC
CTGGTGCAGG GGCTTGGTCA TTGCAGCGCC TGCCATGCCG AGCGGAATCC ACTGGGTGGC
ATGATTGGCC GCAAGGGAGA TGATAAGCTG GGAGGTGGGC AGATCATGGG CTCCAACTGG
TACGCGCCAT CGCTGACTTC GAGTCTGGAG GCAAGTACGG CTGGCTGGCC GGTCGAAGAA
ATTGTCCAAC TGCTGACCAC CGGGATTTCA CCCAGGGCGA CGACGTCAGG ACCGATGGCC
GAAGTTGTCA GTCAGAGCCT CCAGCATCTG ACAAAAGAGG ATGCCCGGGC AATGGCGATC
TACCTCAAAT CACTGCCTGA AACACAATCG CACCAACAGG TAAATTCTCC CGCACAGACG
GAACAGGTGC AGGCCTGGTT GCGGTACGGG GCACGGATAT ACAAGGAACA CTGCCAGGAT
TGCCATGGCG ACTCGGGGCA GGGCGCCCCG GGAATTTACC CCCCCTTGGC CGGTAACCGA
AGCGTTACCC TTACACCCCC TACGAATGTC ATTCGCAGCG TGCTCAATGG AGGTTATCCC
CCGTCCACTG CAGGCAATTC CCGCCCCTAC GGCATGCCCC CATTCGCACA GGTTTTACGC
GATGGCGAGG TTGCGCTGGT TCTGTCGTAT ATCCGCAACG CGTGGGGCAA TCGCGCCAGT
CTGGTGACAA CTGCTCAAGT CGACAAGAGT CGCGAAGGGA TAACGGAACG CTGA
 
Protein sequence
MKIAVIAAAV AVLAGIGLSP EIVGAPSPKK EAKAEKNATP LSSVLLQQRR ARGAYLARAG 
NCIGCHTAQG GGAYAGGRKL STPFGTFVTS NITPDKATGI GDWDEDDFWK ALHEGKSRDG
RLLYPAFPYT EYTKVTREDS DAIFAHLQAL EPVVQQNPPS QVASRYDFQP LLTLWRAAYF
KPGVYQADPA KSTEWNRGAY LVQGLGHCSA CHAERNPLGG MIGRKGDDKL GGGQIMGSNW
YAPSLTSSLE ASTAGWPVEE IVQLLTTGIS PRATTSGPMA EVVSQSLQHL TKEDARAMAI
YLKSLPETQS HQQVNSPAQT EQVQAWLRYG ARIYKEHCQD CHGDSGQGAP GIYPPLAGNR
SVTLTPPTNV IRSVLNGGYP PSTAGNSRPY GMPPFAQVLR DGEVALVLSY IRNAWGNRAS
LVTTAQVDKS REGITER