Gene Noc_0937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0937 
Symbol 
ID3707328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1035942 
End bp1037240 
Gene Length1299 bp 
Protein Length432 aa 
Translation table11 
GC content51% 
IMG OID637737446 
Producthypothetical protein 
Protein accessionYP_342979 
Protein GI77164454 
COG category[C] Energy production and conversion 
COG ID[COG0247] Fe-S oxidoreductase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTCTCAGC TTGGGCAGGA TGCCTGTTCA TCCTATAAAA CACAAGGAAT AGCGTTTCTG 
GAGAAAGGGA CGGATAGTTG TGTCAGGTGT GGATTGTGTT TGCCCCATTG CCCTACTTAC
CTCTTGACTG GCGATGAAAG TGAATCTCCC CGAGGCCGGA TTTCTCTCAT TAGGGCATTA
GCACAGAATC AGCTGGCGCC TACTCCGCCC TTGTTAGGCC ATTTGGAACG CTGCCTAACT
TGCCGCGCCT GCGAGTCAGT TTGCCCTTCT GATGTTCCCT ATGGGCGTTT AATTGATACC
GTCCGTGCTC AAGTCCGAGT CCAGCAGTCC CGTGGGCAAC GGGTTAAAAA ACATCTCCTC
CACCAGTGGC TAATCCCTCG TCCAACCTTA CTTCGTGGTC TTGGAGGATT GATGAGACTG
GCCCAGCGCA TAGGGGTTGT AGGGTTAGCT CGGCGCAGCG GGTTGTTGGA AGCTTTAGGA
TGGGCTTCGC TAGAGGCTTT GCTGCCTGAA TTACCTCCTC GGCAAGGATG GCAGCCCTTT
TTTCCTGCTC AGGGCAAGGA GCGGGGCCAA GTCGCTTTAT TTACAGGTTG CGTTACTGAT
GTGGTGGATC AACCCTCATT AATGGCAACG GTGCAATTGC TAAACCAGGT AGGTTATGGG
GTCCATATTC CTAGGAAGCA GGTTTGCTGT GGTGCTTTGG CTAGACACGA CGGTGAATGG
GAACGAGCCT TGGCACTAGC GGTTGAAAAC ATCACAGCAT TTGCTACTGC GGAAGTCGAA
GCGATTCTCT GCACTGCCAG TGGTTGCACA ACGAGTCTGG TAGACTATCC TCAGTGGCTG
CAAGAGGTGG GAATGGAGGC TGTCGCTGCT CGCGATTTTG CTGGTAAACT TTGGGATGTT
AACCAATTTC TACTTCAAAG GGCTTGGCCG GCAAGCGTAA CCCTGAAGCC TCTGGCAAAA
CGGATTGCGG TACAAGATCC TTGCAGCTTG CGCCATGTTT TGCACCAGCA CGAGGCGGTA
TATACTTTAT TGCGCCGAAT TCCAGAGGCT GATATTTTAC CGCTCCCAAG CAACGGGCAA
TGCTGTGGCG CCGCTGGAAG CTATATGCTA ACCCAGCCAG AATTCGCCCA ATTTCTACGT
GCCGAGAAAA TAAACGCCTT ACAGAAAATT CAACCTGATA TTTTGGTGAC CTCTAATATT
GGGTGTGCGC TGTATTTGGC TGCGGGGATA AAAGAGGCGG CTCTTCCTAT TGAAATATTG
CATCCTGTGC AATTATTAGC TCGGCAACTT AGTTTTTAG
 
Protein sequence
MSQLGQDACS SYKTQGIAFL EKGTDSCVRC GLCLPHCPTY LLTGDESESP RGRISLIRAL 
AQNQLAPTPP LLGHLERCLT CRACESVCPS DVPYGRLIDT VRAQVRVQQS RGQRVKKHLL
HQWLIPRPTL LRGLGGLMRL AQRIGVVGLA RRSGLLEALG WASLEALLPE LPPRQGWQPF
FPAQGKERGQ VALFTGCVTD VVDQPSLMAT VQLLNQVGYG VHIPRKQVCC GALARHDGEW
ERALALAVEN ITAFATAEVE AILCTASGCT TSLVDYPQWL QEVGMEAVAA RDFAGKLWDV
NQFLLQRAWP ASVTLKPLAK RIAVQDPCSL RHVLHQHEAV YTLLRRIPEA DILPLPSNGQ
CCGAAGSYML TQPEFAQFLR AEKINALQKI QPDILVTSNI GCALYLAAGI KEAALPIEIL
HPVQLLARQL SF