Gene Noc_0943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0943 
Symbol 
ID3707334 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1041349 
End bp1043334 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content51% 
IMG OID637737452 
Productcytochrome c-type biogenesis protein CcmF 
Protein accessionYP_342985 
Protein GI77164460 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1138] Cytochrome c biogenesis factor 
TIGRFAM ID[TIGR00353] c-type cytochrome biogenesis protein CcmF 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value0.190679 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACTCCAG AACTTGGACA ACTCGCACTG ATTTTAGCCC TCACCCTAGC CTTCTCCCAG 
GCAATATTGC CTCTTATTGG CGCTCAGCGT GGAATCATAG GCTGGATGAA TGTAGCACGA
ACTGCCGCTT ACGGTCAGTG TTTCTTCCTA ATCGTTGCCT TCACCTGCCT GGCTATCAGC
TTTCTCAATA ATGATTTTTC CGTAGCCTAT GTAGCCAATA ATTCTAATTC CGCGCTACCA
CCAGCCTATC GTTTTGCCGC CATCTGGGGG TCCCACGAAG GATCACTATT ACTGTGGAGC
CTAACTCTAG GCCTGTGGAC AGTAGCTGTG GCCTTATTTA GCCGTAGCAT ACCATTAGCC
TATGTGGCCC GTGTGCTAGC CGTCATGGGC ATGGTTAATG TGGGCTTTCT GTTATTTATG
CTCCTAACCT CTAATCCATT TTTACGTCTT TTCCCCGCCC CCTTAGACGG ACAAGATCTC
AATCCTTTAT TGCAAGACCC CGGCCTAGTG ATGCACCCCC CCATGCTCTA TATGGGTTAT
GTGGGCTTTT CCGTGGCCTT TAGCTTTGCC ATTGCGGCGC TTATCGGCAA ACGCCTAGAT
GCAGCCTGGG CGCGGTGGTC TAGACCCTGG ACGGTAGTTG CGTGGTTATT TTTATCTATT
GGGATTACCT TAGGGAGCTG GTGGGCCTAC TACGAACTCG GCTGGGGCGG CTGGTGGTTC
TGGGATCCGG TTGAAAATGC TTCCTTTATG CCGTGGCTTG TGGGTACCGC GCTAATTCAT
TCCCTTGCCG TGACTGAAAA ACGGGATGCC TTCAAGCGCT GGACAGTACT GCTCGCCATT
TTTACTTTCT CCCTCAGCCT GCTTGGAACC TTTATAGTGC GCTCCGGGGT ACTCACCTCC
GTGCATGCCT TTGCAACGGA TCCTACGCGG GGTGTTTTTA TCCTGATATT GCTAGGACTT
GCCGTAGGCG GATCACTCCT TCTCTATGCC GTGCGGGTGC CGCAAGTACA GGATACAGGC
CGTTTTCATC TTCTTTCCCG CGAAGGGCTT CTATTGGCGA ACAATGTCGT ACTCATAGTC
GCTGCAGGCA GTGTGTTGTT AGGTACTCTT TATCCACTGG TTCTCGAAGG ATTAGGGTTA
GGTAAAATCT CGGTAGGACC ACCCTATTTT GATACCGTAT TTGTGCCTTT AATGGTCATC
CTAGCTTTCT TGCTAGGAGC TGGCCCCCTA TCCCGCTGGA AGCAGCAAAG CTTAGGGGAA
CTCGTCAAAA AATTAGGTTT TATCTTTGCC ATTAGCTTAA CAATTGGAAT TCTAGTTCCC
TATATCGTTG ATAAAGGAGA TATTTTCAGC GCAGCCGCCG GGCTGACCGT CGCTATCTGG
GTTGCCCTAA CAACTCTTCT TGGCCTCTGG AATCGACTCA GAAACCGGGG GGGAATTGTA
GCAGGAGCAC GTTCCTTATC GCGGAGCTTT ATAGGCATGT CCTTGGCCCA TACCGGCTTT
GCAGTCGCTA TTGTCGGCGT CACTTTAACC ACTATCTATG GACAGGACCG GGATGTGGGC
GTTACCGTTG GCGAGACAGC GGAATTAGGC GCCTATGAGT TCCGCCTAGA TAAAATTCAT
GAGGTTGATG GTCCCAACTA TCGGGCTATT GAGGGAACTA TATCGGTCTT CAAGGGTGAA
GATTTGATCA CGACCCTCCA CCCCCAAAAA CGCATTTACC TAGTACAAAC TTCTCCCATG
ACGGAAGCGG GTATTGATGC AGGGTTATTC CGGGATCTAT TCGTTGCCTT GGGAGAATCT
CTTGGCAGTA ATACTTGGAG TCTACGCATT CAATTTAAGC CTTTTGTACG CTGGATTTGG
CTGGGTGGAT TATTGATGGC CCTAGGCGGC GCAGTTGCCG CCAGTGATCG ACGTTACCGG
GTGAGCGTAC AAAAAATGCG CTTCAAGCTT AACCCTCCCA AACCTGAGGC TGCGGCTGAA
ATCTAA
 
Protein sequence
MTPELGQLAL ILALTLAFSQ AILPLIGAQR GIIGWMNVAR TAAYGQCFFL IVAFTCLAIS 
FLNNDFSVAY VANNSNSALP PAYRFAAIWG SHEGSLLLWS LTLGLWTVAV ALFSRSIPLA
YVARVLAVMG MVNVGFLLFM LLTSNPFLRL FPAPLDGQDL NPLLQDPGLV MHPPMLYMGY
VGFSVAFSFA IAALIGKRLD AAWARWSRPW TVVAWLFLSI GITLGSWWAY YELGWGGWWF
WDPVENASFM PWLVGTALIH SLAVTEKRDA FKRWTVLLAI FTFSLSLLGT FIVRSGVLTS
VHAFATDPTR GVFILILLGL AVGGSLLLYA VRVPQVQDTG RFHLLSREGL LLANNVVLIV
AAGSVLLGTL YPLVLEGLGL GKISVGPPYF DTVFVPLMVI LAFLLGAGPL SRWKQQSLGE
LVKKLGFIFA ISLTIGILVP YIVDKGDIFS AAAGLTVAIW VALTTLLGLW NRLRNRGGIV
AGARSLSRSF IGMSLAHTGF AVAIVGVTLT TIYGQDRDVG VTVGETAELG AYEFRLDKIH
EVDGPNYRAI EGTISVFKGE DLITTLHPQK RIYLVQTSPM TEAGIDAGLF RDLFVALGES
LGSNTWSLRI QFKPFVRWIW LGGLLMALGG AVAASDRRYR VSVQKMRFKL NPPKPEAAAE
I