Gene Noc_2136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_2136 
Symbol 
ID3705328 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2462194 
End bp2464104 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content37% 
IMG OID637738612 
Producthypothetical protein 
Protein accessionYP_344126 
Protein GI77165601 
COG category[S] Function unknown 
COG ID[COG3011] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value0.810683 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTATCAAA AAGTATTCAG TCTCGTCAAA ATTGGACTTG AAAAACAGGT GCCGGCGTTA 
GGGCTTGGCG TTTTTCGTAT TTTCTTGGGT TTAGTTATTC TTCAAGAAAT TGTTTTTCTC
TATTATTTCC GCCATCTTAT TTTTGATACT ATTCCTTTTA TTGATGTCGC TTCGCCTTCC
ATTTATTTTT TCTTGATTTT ATGGGGCATT AATACCCTTT TTTTAACTAC GGGTTACCAT
ACGCGTCTCG CCGCTATTGT CAATTATTTT TTTTGGGTCA TCTTTACCGC CTTTACCCCT
ATGTGGCGAG ATTTTGATGG GGGTTTTGAT CAGTTTATGA TCGGATCGAG TTTTCTTCTT
ATTTTCTTGC CCACGGAAAG AGCATTCTCC TTAGACAATC TTAGAGTAAG ACTTAAATTT
CTGAAATCGG AATTGCACCA CGATCCTGTT AGTACCGTTT CTGTTCTTAG CTATTATCTT
CCTCTCGCTA TCTCTTTAGG ACTCATTTAC TTTGATTCGG CTGTTCATAA ATTATTTGCG
GAGCATTGGC GTAATGGATT AGGCGCATGG TTACCTTTGA CGATGCCTTA TTATATTTCC
GCGATCGATA TGACGTGGTT CCTGAATCAA GAATTCCTGC AAAAATTTAT TGGTTATTTG
ATTATAGTTT TTGAATTTAT TTTTATTTTT ACTTTTTATC TCCGGTCTTT TCGCGTGCCT
TTGATGATTA CGGGGATCTC TCTCCATAGC GGTATTATTT TATCCCTTAA TATTTATCCT
TTTGGCTTCG GAATGCTGGT TTATTACTTT TTGATGGTCC CTTTTTCATG GTGGCAGGGC
TTAAAGAAAA CATTACAGTT TAAGTCGCCA CAATTGGTTG TTTTCTATGA TCAACAATGC
CCGCTTTGCA ATCGCACCAG AATTATTATA GAGCATTTTG ATATTTTTAA AGCTATCAAT
TTTGAAGGAC TGCAAAAGCG TGCAAAAAAA TATCCTGAAC TGAATAACAT TTCTGAAGAA
CAATTGCTAA AAGACATTTA TGCTCTCGAT CAAAAAGGAC ATCTGTACGT AGGTATAGAC
GCCTACCTGC AAATTTTATT GAAAATGAAA TATCCTGCTC TTGCAGGAAT TTTTATAAGA
ATTCCAGGGG TTTATCATTT TGGGAAAAAA ATATATCGGC GAATTGCTGA TCAGCGTGCT
CGTCTTACCT GCGATGAGAG CTGCTTTGTT TCTTCAGAAA ATTCCCTACA GGAGGCATAT
AGCTTCAAAA GAAGCTACGA ATATTATGCT GGAACAAAAA AACAACGCTC TAATCGGATT
ACCAAGTTTT TAGTATTGAT CATGCTTTTG CAGTTAAATA GTACGATTCA CTATGGGATA
TTTTATCGCC TCAACGAGGA TGGAGCAGAA AGCGAGATCG GCCAGATTTT ATCGCCGATA
AGCAATGCAG TATTGTTTCT ATCTCATGCC TTTCTAGGAA TAACACCCCA TGCGCTTTAT
ATGCATGATC ATTTTCATGG TTATAACCAC ATTTTGGCGC TCACCTACAA AAACAGCCAG
GGACAGGAGC AATGGCTTCC ATTTGTGAAC GAGGAAGGAA GGTTAGTTGC GCCAAACTGG
GGAAGAGTTC AGTCTATGTG GGCAAATGTG GCAGTGACTC CCCATATAGA GCAAAGGCGT
CTTTATAAAT TTATTAAAAA AATGACTGCT TTTTGGGGTA AGAAGATAGA TCTGGATTTA
CAAGATACTG AATTTATCAT AAAAATGAAG AGAATAGATG TTCCTGTGCA TTGGGAAAGG
AATCTGCGTA ATAAGAATAT AAATCGGCCA TGGGTAAATA TTGGGAGAGT AATTTGGCAC
AAAGGCTTGG CAAGAATAGA GATACAGGAT ATTAATCTTG AGTCATTATA G
 
Protein sequence
MYQKVFSLVK IGLEKQVPAL GLGVFRIFLG LVILQEIVFL YYFRHLIFDT IPFIDVASPS 
IYFFLILWGI NTLFLTTGYH TRLAAIVNYF FWVIFTAFTP MWRDFDGGFD QFMIGSSFLL
IFLPTERAFS LDNLRVRLKF LKSELHHDPV STVSVLSYYL PLAISLGLIY FDSAVHKLFA
EHWRNGLGAW LPLTMPYYIS AIDMTWFLNQ EFLQKFIGYL IIVFEFIFIF TFYLRSFRVP
LMITGISLHS GIILSLNIYP FGFGMLVYYF LMVPFSWWQG LKKTLQFKSP QLVVFYDQQC
PLCNRTRIII EHFDIFKAIN FEGLQKRAKK YPELNNISEE QLLKDIYALD QKGHLYVGID
AYLQILLKMK YPALAGIFIR IPGVYHFGKK IYRRIADQRA RLTCDESCFV SSENSLQEAY
SFKRSYEYYA GTKKQRSNRI TKFLVLIMLL QLNSTIHYGI FYRLNEDGAE SEIGQILSPI
SNAVLFLSHA FLGITPHALY MHDHFHGYNH ILALTYKNSQ GQEQWLPFVN EEGRLVAPNW
GRVQSMWANV AVTPHIEQRR LYKFIKKMTA FWGKKIDLDL QDTEFIIKMK RIDVPVHWER
NLRNKNINRP WVNIGRVIWH KGLARIEIQD INLESL