Gene Nmul_A1204 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNmul_A1204 
Symbol 
ID3786135 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosospira multiformis ATCC 25196 
KingdomBacteria 
Replicon accessionNC_007614 
Strand
Start bp1390187 
End bp1391257 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content52% 
IMG OID637811289 
Productcytochrome oxidase assembly 
Protein accessionYP_411899 
Protein GI82702333 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1612] Uncharacterized protein required for cytochrome oxidase assembly 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCAGTTTG TCCAAACTTC AACACTTCCG CAAAAAACCG ATCAAAAGTT TGCTGTGCAG 
AAACCGATTG CTATCTGGTT GTTCGTTTGC TGCGCTCTGG TATTCGCCAT GGTGGTTGTA
GGCGGTGTAA CCCGTCTTAC CGATTCAGGG CTTTCCATCG TCGAATGGCA ACCCCTCGTT
GGCACGGTTC CTCCACTCAG CCAGAATGAC TGGGATGAGC TCTTCGAGAA ATATCACCAG
ACGCCTCAGT ATAAAAAGGT AAATCTTGGC ATGAGCCTGG AAGAGTTCAA GACAATCTTC
TGGTGGGAAT ATTTCCACCG CTTATTGGGG CGCGTCATCG GGTTGGCATT TTTCATACCA
TTCCTGTATT TTCTGATGAA AAAGGCAGTC GACCGGCCAC TGGGACTGAA GTTGTCAGGA
ATTTTCCTGC TGGGGGCTTT GCAGGGTGGG ATGGGATGGT ACATGGTAAA GAGCGGGTTG
GTGGATAACC CCCACGTCAG CCAGTATCGT CTGACTGCGC ACTTGGGTCT CGCTTTCGCG
ATTTATGCCG CAATGTTCTG GGTAGCCCTC GATCTGCTCA ATCCCGGCCG CGGTTTGTCC
GCAAACAGCG GACTGCGTGG TTTGCTCAAT TTCTCCACCA TGCTGTCTGC CCTGGTATTC
ATAATGGTTT TATCGGGCGG GTTCGTGGCA GGCATTCGGG CAGGTCTGGC TTACAATACT
TTTCCACTCA TGGATGGCCA CTTCATCCCC CCGGAACTAT TCATGCTGGA ACCCTGGTAC
CGGAATTTCT TCGACAATAT GACCACTGTG CAATTCGACC ATCGCCTGAT TGCATGGACA
CTGGCAATTC TCGTTCCGAT TTTCTGGCTC AAATCGAGAG CAGTGCCACT TTCAGGCTCG
GCTCGTCTTG CATGCACTCT ACTGTTGATC ATGCTGGCAG TGCAGATCAC TCTGGGGATT
TCCACGCTGC TGCTGGTTGT TCCTCTAACC CTCGCGGCAG CACATCAGGC AGGCGCACTA
CTGTTGTTTA CCGCTGCCCT TTGGGTGAAT CATGAGCTAC GGCGCCAATA G
 
Protein sequence
MQFVQTSTLP QKTDQKFAVQ KPIAIWLFVC CALVFAMVVV GGVTRLTDSG LSIVEWQPLV 
GTVPPLSQND WDELFEKYHQ TPQYKKVNLG MSLEEFKTIF WWEYFHRLLG RVIGLAFFIP
FLYFLMKKAV DRPLGLKLSG IFLLGALQGG MGWYMVKSGL VDNPHVSQYR LTAHLGLAFA
IYAAMFWVAL DLLNPGRGLS ANSGLRGLLN FSTMLSALVF IMVLSGGFVA GIRAGLAYNT
FPLMDGHFIP PELFMLEPWY RNFFDNMTTV QFDHRLIAWT LAILVPIFWL KSRAVPLSGS
ARLACTLLLI MLAVQITLGI STLLLVVPLT LAAAHQAGAL LLFTAALWVN HELRRQ