Gene Noc_0371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0371 
Symbol 
ID3706542 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp404477 
End bp408265 
Gene Length3789 bp 
Protein Length1262 aa 
Translation table11 
GC content54% 
IMG OID637736883 
Producthypothetical protein 
Protein accessionYP_342427 
Protein GI77163902 
COG category[S] Function unknown 
COG ID[COG2911] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGAAGCTAC TGCGTTCGAT TGGCGCTTTT ATTCTGCTGT TGTTAGTGCT GCTGGTAGGG 
GGGCTTGCCT ATTTGCTCTT GACTGAGGTG GGAACTCGGC AATTATTGGC CCAGGTGGCC
CAGGTGATTC CGGGAGAGTT AGAAACCCAA CAGGTTGAGG GGACTTTGGG GGAGGCGCTT
ACCCTGACCG GATTGCGCTA CCGGACTCCG GATTTTACTC TTGAAGTCGG GTATTTTCAC
TTTGCCTGGC GGCCGGCAGC CTTGTTGGGG GCAACTTTCT GGGTGGAGCA GCTTCATCTT
GGAGAAGTCA GCTGGCGCCA AAAACGCCCA GGGGAGTCTA CGGCCTCGCA AGAGCCGATA
GTGCTTCCCG AAATACAGAT CCCTTTAAAA GCTAAGGCAG AAGACGTCCG GCTGCAAAAT
ATTTCCTTAA CTCCCCTTGG ATCGTCCCCT GTGGTTATTA ATACTATTGT CTTCAAGGGG
AATTTCGATG GCCAAGCCTT ACAAGTAGGC GAATTGGGGG TTTCTGCTCC CCAAGGAGAG
GTGCAGGTGA GTGGAGAGAT GGCTTTCCAG GAGGCTTATC CCATGGCGTT CGCCCTGGCT
TGGGGGGCAC CGGTTCCCGA GTTGGGGAAG GTCACTGGCA TTGGTAGGGT GAAAGGAGAT
CTTCGCCAGT TGACTCTACA CCAAACCGTT CAAGCTCCCT TCCATCTGCA ATTCCGAAGC
AGGCTTTTTG AGCTATTGGA ACAGCCTCGC TGGGTTGCGG CGCTGGAGGT TCCCGGGGTC
GAATTGCAGC ATTTATCTGC CCAGTGGCCA GCAGTACGTT TTGGGTTAGA CCTTAAGGGC
GCAGGGAGCT TGGAACAGTT TAAGGTACGA GCCTCCTATC AGATTCAGGA GGCTCAAACA
GGCGAAGTGA GGGGCAGTCT CTCGGTGGAA CAGCTTGCCA TGGGGCATTG GCTCCTGGAC
CGGCTGACCT TGCGGCAAGT TGAGGGGCCT GCTCGGTTAG CGCTCCGGGG GGAGGTGATG
ATGGCAGAAG ACCAGCCTCG TATGAGCCTT GCGGGCCAGT GGCAAGATAT GGCTTGGCCC
CTGAGAGGGA CGGCTCAAGT TTCCAGCAAC CGGGGCCAAT TGACCTTGGA AGGAACGCCA
GCCGCCTACC GGTTACAATT GAACAGCGCT CTTGCCGGTC AGGATATTCC CACCAGCGAG
TGGCATTTGA TGGGTACCGG CGATACCACC CAGTTTGAGT TAGAGAAGTT ACGGGGACAA
TTGTTGGATG GGGTTTTGAG TGGTTCGGGC AATTTCCGCT GGACTCCCGC CTTGGCCTGG
GATGTGCGAG TAGACGGAGA AGCGCTTAAT CTGTCGAAGG AATGGCCAGA ATGGCCAGGA
GTATTGTCCT TTAGCAGTGA TACTAACGGT GTTTTAGAGG AGGACGCCCA GGATATCACG
CTTGATCTCC ACGCCCTTTC TGGCTCCCTA CGGGGCTATC CAGTAGCGGC CCAGGGAAGA
GTTCAACGAC AAGATAATAC CTGGCGTATT GCCGATTTAA AGTTACGCTC GGGAGATTCC
CGGCTTTCTT TGGGGGGAAC GGTCAATGAG CGGCTAGCGC TAAAGTGGCA TCTCTCTTCC
CCGGATCTTT CCCAACTGCT GCCGGAGGCT CAAGGGGATT TGTTAGTAAA GGGGCATGCG
AAAGGGCCGC TTACAGGCCC GGAACTTACC TTTCGACTCC AGGGGAAAGC TCTGGCCTAT
CAGGATTATC AAGTGGAATC GGTGATGGCT AATGTGGATG TGGACTTACA GGGGAAGCAG
TCATCGCAGG TGCGGATCGA TGCCTCGGAT TTCACTTTGG CAGACCAGAC TCTCCGCTCT
GTGGCTATTG AGGGGGGCGG CACGCCATTG CACCATAAAC TGAGTCTGGC AGTGAAGGCG
CCGGAGCGTT CCCTGGACCT TGGATTCCAA GGCTCCTGGA AAGAGGAGGT TTGGCAGGGA
GAAATTACCA AGACCGAGCT TACGGACTCG CTCATGGGTC ACTGGGAGGC AGTGAGCGCC
ACTTCCCTTA CTCTGAGTCG CAGCAATATT GATCTCGCTC CTTGGTGCTG GCGGCAGCAA
TCTGCCCAGC TCTGCCTAGG TGGCAGTTGG CAGGAAGAAA GTTTTTGGCG AGGAAGTTTC
AAGCTAGAGG ATTTCCCGTT AGCAATGCTA GGGCCGCTTT TACCGGAAAA AACAGCACTG
GAAGGGGTGA TTGGGGGCGA GGTGCAGGCA CAGGGAGAGG CTCACCAGTT AGTCCAGGCC
CGGATGCAGC TTGCGGCTTC CGGGGTTCAA TTAACCCAAG TGACACCTGA GGGGCAGTCT
CTGCGCTTTC CCTATCAAGA CATGCAGGCC AGGCTTAATT TGGAAGATAG GGGAGGGAAA
GCAGGTTTTG AGCTACTTTC AGCCGATCCT GGCACAGCGC CAGTTAGGGC CTCTCTGCGC
CTGCCTTCTG CTCCTTTGGA TCTAACTGCC TTGGGGCAAT TGCCTTTAGA TGGCCAGATC
TCAATGGCTT TTAGGGATCT CGCTTTTTTG GAAACGCTGA TACCAGAATT GGAAGCGGTT
CAGGGACAAT TGCGGGCAGA TCTGACCTTG GGAGGACAGG TTGCCGCACC TCAATTACTG
GGAGAAGTTG TGCTTCAAGA AGGAAGCGCC CAGGTCGTTC CCCTGGGGTT AAAGTTGATA
AAGATCCGGT TGCGGGCAGA GGCAAGCGAG CAAGATAGGA TCGTTTTCAC GGGCGGGGTA
CACTCGGGGG AGGGAGAGTT AGCTGTCAAT GGCCAGGTTC GTCTGGAACC TGAGGCGGGT
TGGCCCGCTA AGGTGACGGT GACCGGAGAA CGTTTTGAAG CCATGGGGAC CTCGGATATC
AGGGTATTGA TCTCACCTCA GTTGCAGATC ACGAAGGCAG AAGAAGCGAT TCGCGTGGAA
GGGGAAGTTG TAATACCAGA GGCCACTTTA GTGATCAAGG ATATTGAGAG CAGAGGGGGG
GTGCCAGTTT CCCAAGACGT AGTGATCATA TCCCAGGAAA AGGAAACTGA AAAAAAGGCT
GTGCCCATTT ATGCCCGAGT CAGAATTATT TTGGGCGATG ATATTTCAGT GCGGGCCTTT
GGTTTTAAAG GGGGAATAAC CGGGAGCCTA TTAGTGACGG AAACTCCCGG AAAGGCTACA
CGGGGAAGCG GTGAACTCCA GATCGTTAAG GGTGAATATA AAGCCTATGG ACAGCAGTTA
AATATTCGGC AGGGTCAGGT GGTTTTTGCC GGACCTATTG ATGATCCCCG GCTGAGCGTA
GAAGCAGTAC GTGAGGTTGA TAATGGTAAT ATAGTCGTTG GAGCGCGCAT CCGGGGGGCT
GCCAGTGAAC CAGTGCTCAC TTTATTTTCT GAGCCGTCGA TGGATGAGAG CAATATCCTG
GCCTATTTGA TCCTAGGGCG GCCTTTGGCG GGAGCTTCTG GGGGTGACGG CGAATTATTG
ACTAAAGCTG CGACTTCTCT TGGCTTGTCC GGTGGCACCC TCCTTGCTAA ACGGCTTGGA
AAAATCTTTG GTTTGGAAGA TGTGGGGATT GAATCCGCTG ATAACGGTAA TGGGAATGGG
GATACCCAAA GTGAGATGTT GATGCTGGGC AAGCAGCTCT CACCCAGTCT TTACATTGGT
TATGGAATCG GGTTATTTGA GCGTTTTAGC TCTTTTCGAA TGCGCTATAT TTTGAGCAAA
AATTGGAGCG TACAAGCCGA AACGGGCCTT GAAACCGGCG CCGATTTATT TTATAGCCTA
GAGCGGTGA
 
Protein sequence
MKLLRSIGAF ILLLLVLLVG GLAYLLLTEV GTRQLLAQVA QVIPGELETQ QVEGTLGEAL 
TLTGLRYRTP DFTLEVGYFH FAWRPAALLG ATFWVEQLHL GEVSWRQKRP GESTASQEPI
VLPEIQIPLK AKAEDVRLQN ISLTPLGSSP VVINTIVFKG NFDGQALQVG ELGVSAPQGE
VQVSGEMAFQ EAYPMAFALA WGAPVPELGK VTGIGRVKGD LRQLTLHQTV QAPFHLQFRS
RLFELLEQPR WVAALEVPGV ELQHLSAQWP AVRFGLDLKG AGSLEQFKVR ASYQIQEAQT
GEVRGSLSVE QLAMGHWLLD RLTLRQVEGP ARLALRGEVM MAEDQPRMSL AGQWQDMAWP
LRGTAQVSSN RGQLTLEGTP AAYRLQLNSA LAGQDIPTSE WHLMGTGDTT QFELEKLRGQ
LLDGVLSGSG NFRWTPALAW DVRVDGEALN LSKEWPEWPG VLSFSSDTNG VLEEDAQDIT
LDLHALSGSL RGYPVAAQGR VQRQDNTWRI ADLKLRSGDS RLSLGGTVNE RLALKWHLSS
PDLSQLLPEA QGDLLVKGHA KGPLTGPELT FRLQGKALAY QDYQVESVMA NVDVDLQGKQ
SSQVRIDASD FTLADQTLRS VAIEGGGTPL HHKLSLAVKA PERSLDLGFQ GSWKEEVWQG
EITKTELTDS LMGHWEAVSA TSLTLSRSNI DLAPWCWRQQ SAQLCLGGSW QEESFWRGSF
KLEDFPLAML GPLLPEKTAL EGVIGGEVQA QGEAHQLVQA RMQLAASGVQ LTQVTPEGQS
LRFPYQDMQA RLNLEDRGGK AGFELLSADP GTAPVRASLR LPSAPLDLTA LGQLPLDGQI
SMAFRDLAFL ETLIPELEAV QGQLRADLTL GGQVAAPQLL GEVVLQEGSA QVVPLGLKLI
KIRLRAEASE QDRIVFTGGV HSGEGELAVN GQVRLEPEAG WPAKVTVTGE RFEAMGTSDI
RVLISPQLQI TKAEEAIRVE GEVVIPEATL VIKDIESRGG VPVSQDVVII SQEKETEKKA
VPIYARVRII LGDDISVRAF GFKGGITGSL LVTETPGKAT RGSGELQIVK GEYKAYGQQL
NIRQGQVVFA GPIDDPRLSV EAVREVDNGN IVVGARIRGA ASEPVLTLFS EPSMDESNIL
AYLILGRPLA GASGGDGELL TKAATSLGLS GGTLLAKRLG KIFGLEDVGI ESADNGNGNG
DTQSEMLMLG KQLSPSLYIG YGIGLFERFS SFRMRYILSK NWSVQAETGL ETGADLFYSL
ER