Gene Noc_0056 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0056 
Symbol 
ID3705932 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp54071 
End bp57892 
Gene Length3822 bp 
Protein Length1273 aa 
Translation table11 
GC content59% 
IMG OID637736581 
Producthypothetical protein 
Protein accessionYP_342128 
Protein GI77163603 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTTAGGT TACAGGATAG GAGATTTGGA CATATCAAAA ACATACCTGT AGAGGAGAAT 
CCAGGGAGGG CGCAAATCAA ATACGGAGAC CTTATCCAAT TCGACCCAAT AGAGTCGGTC
GTTCAGTTGC GTGACGCGGA CAAGTCGAGC GCCGCGCACA CCCTCGTGAA TACCTATGTC
ATTTCCAAGG AAATGGCCGA ACGGCTCACG CAGCTTGTCA TTCCTCAGAT GCAGTTCGAC
CAACCGGTCG ATAACAAGGG TCTGCTGGTC GTCGGTAACT ATGGCACCGG TAAGTCGCAC
TTGATGTCGG TGGTCTCCAG CCTTGCCGCA GACGCCTCAC TGCTGGAGGA GCTGAACCAC
GCCAGTGTAC GCGACGCCGC CGCTCAGATC GCTGGACGTT TCAAGGTGAT ACGTACCGAG
ATTGGAGCCA CCACCATGTC CCTGCGCGAC ATCCTGGTGG CCGAACTGGA AGAGCATTTC
GAAAAACTCG GCGTGGAGTA TGTGTTCCCT GAAGCCGGGA CTATCATCAG CCACAAACGG
GCCTTCGAAG ACATGATGGC CAAGTTCGGC GAGGTCTTCC CCGAACACGG CCTGCTGCTG
GTGGTCGACG AGCTGCTCGA CTACCTACGC ACCCGCAAGG ACCAGGAGCT GATTCTCGAC
CTTAACTTCC TCCGCGAGGT CGGCGAGGTC TGCAAGGATC TGCGTTTCCG CTTCATGGCC
GGTGTCCAGG AAGCTATTTT CGACAGCCCG CGCTTCGCCT TTGTGGCCGA CAGCATTCGC
CGGGTAAAGG ACCGCTTCGA GCAGATCCTT ATTGCCCGCA GCGACGTGAA ATTCGTCGTG
GCGGAGCGCC TGCTCAAGAA AACCGCCGAG CAACAGGCCA AGATCCGCGA CTACCTGATG
CCCTTTGCCA AATACTACGA CGGGCTCAAC GAGCGCATGG ACGAGTTTGT TCGGCTCTTC
CCGGTACATC CTGATTACAT CGACACCTTC GAGCGCGTTA CCGTGGTGGA AAAACGTGAG
GTGCTTAAGA CCCTGTCTAT GGGCATGAAA GGCATTCTCG GCAAGGATGT ACCGCAGGAC
GAACCTGGTC TGATCGCCTT CGACAGTTAC TGGAACACGC TCAAGCAGAA TGCCTCCTTC
CGCGCCATTC CCGAGATCCG GGCGGTCATC GATTGCAGCC AAGTGCTGGA GTCTCGCATC
GAGAACGCAC TGACCCCGAA AAACTACAAG CCCATGGCAG TGCGCCTAAT ACACGCCCTG
TCCGTTCACC GTCTCACCAC TGGCGACATC TATGCCCCTA TGGGCGCATC CGCCGAAGAA
TTGCGCGATC GGCTCTGCTT ATTTGAACCG TTAATCACGG AAATGGGGAG TGATGAGCCT
GACAAGGATC TACAGACCCA TGTAGAGACT GTACTGCGCA AAATCCACAC AGCTGTCAGT
GGACAGTTCA TTTCTGAAAA CAAGGACAAC CGCCAGTTTT ATCTCGATCT GAAGAAGACC
GATGATTTCG ACGCCCTGAT CGACAAACGG GCCGAGAGTC TGGGCCAGGC TCAGCTCGAC
CGTTTCTATT ACGAGGCACT CAAGCGTGCC ATGGAATACC AGGACGCCAC CTACGTGACC
GGCTACAAGA TCTGGCAGCA CGAATTGGTC TGGCAGGAGC ACAAGGCCGC CCGTACCGGC
TACCTCTTTT TCGGCGCTCC CAACGAGCGC TCCACCGCCG TGCCGCAGCG GGACTTCTAC
CTGTATTTCA TTCAGCCCAA CGATCCGCCG CGCTTCAAGG ACGGCAAGGT GAGTGACGAG
GTTTTTTTCC GCCTGAAGAG CACCGACGAG GAATTCCAAA CCGCGCTGAA GAGCTATGCG
GCAGCCCTGG ACCTTGCGGG CACATCCTCG GGCCATGCCA AGGCCACCTA TGAATCCAAA
GCCAACGGCT TCCTGAAGAA GCTGGTTCAG TGGCTGCAGA AGCACATGAG CGATGCCTTC
GAGGTCACCT ATCAGGGCCG CGCCAAGTCC ATGACCGAAT GGGCCAAGGG CAAGTCCATC
CGCGACTTGT CCGGTCTGCT ACCCCACGAG ACCATCAACT TCCGTGACCT GGTCAACACC
ATCGCCGGTG TCTGCCTGGC ACCGAACTTC GAGAACCAGG CTCCGGACTA TCCGTTCTTC
TCGGTCCTGA TTACCGGCAA TAACCGCGCC CAGGCCGCGC AGGATGCCTT GCGGACCATC
GCCGGACAGA ACCGCACCAA GCAGGCCACC GCCGTGCTGG ACGCCCTGGA TCTACTTGAC
GGCGAGAAAA TAGACCCCTA CAAGTCGAAG TACACCAAGT TTATCCTCGA TGCGGTCAAG
GCCAAGGGGC ACGGCCAGGT AGTCAACCGC AGCGAGATCA TCCAGGATGA CCAGGGGCTG
GAATACATGA ACCCGGGCGG TTCGCGTCTG GAGCCGGAAT GGGTGAGCGT CCTGGTGGCG
GCGCTGGTCT ACTCCGGCGA CATCGTGCTC GCCATCCCGG GCAAGAAATT CGATGCCACC
GGCCTGCAGC AACTTGCCGC GACCGGCATG GACGAGCTGG CCCGCTTTAA GCACCTGGAG
CAGCCGAAGG AATGGAACTT GCCAGCGCTC AAGGCGCTAT TTGAGCTGCT CGGCATGACG
CCCGGCATGG CCCAGCTCGT CACCCAGGGC AAGGACGAGC CGGTGCAGAA CCTGCAGCAG
GCGGTGGGCA AGATCGTCAA GCGCATCGTC ATGACCCAGC AAACCCTACG CGAAGGGCTT
TCCTTCTGGG GGCTAGACCT GCTTGTGGGC ACCGACCTGG CCAGCCAGAC CAGCGGACTG
GACGAGGCCA AGGGCTTCTT TGAATCGCTG CAGGCTTACT CCTCGCCGGG CAAGCTGAAA
AACTTCCGCT ACAGCGCTCC CGAAGTACTG GCCCACGAAA AGGCCATGAA GGCGCTGGAC
GAGCTGGATG CCCTGCGCGC GTTCATCATG GACAATGGCC CGACGGCGTC CTGGCTCTCC
ACCGCCGAGG CGGTGTTGCC TGCCGAGCAT GATTGGGTGG ATCGCATGAA GACCACCCGA
CAGGACGTAC TGGATGCCTT CAAGCAGGCT AATCTGACCG AGCTGACCAG CCAGTCCCAG
AGTCCCTTAT CCGGAATCGG CGCCAAGCTA CAGAAGCTGA AGAAGGATTA CACCGTCGCC
TACATCGGTC TGCACACCAA GGCCCGGCTG GGGGTGAACG AGGATAAGCG CAAGGCGGGA
CTGCTCAACG ACCCGCGTCT GCAAACCCTG CGTAAGCTGG CCGGTATCGA CCTGATGCCA
CGGCAGCAGC TCACCGATTA CCAGAACCGC CTGGCCGGGC TGAAGAGCTG CTTCGCCTTG
ACCGAGCAAA ACCTCGACGC CTCGTCCATC TGCCCGCATT GTGGCTTCCG GCCTGCGGTG
GAAACCGGTG CGGCCGTGGG GTCGCAGATG ATCGACCAGA TGGATACCCA GCTCGATGCC
ATGGTGGCGG CCTGGACCTC CACCATCCTC AGCAATCTGG AGGGCCCGAT CACCCAGGCC
AATATGGATC TACTGAAAAT CGACGACCGC GAACCGCTGG AAGCCTTCAT CAAGTCGAAG
GAACTGCCGG TGCCGCTGGA CAGCAACTTC GTTCACGCCC TGAAGGAAGT GCTCTCCGGT
CTGGTCAGGG TCACCGCCAA GGCACAGGAG CTGCAACAGG CCCTGCAGGT TACCGACGGC
CCGGCCACCC CGGTGGAGAT GAAGAAACGC TTTGAGGAGT ACATCGATCA ACTCACCAAG
GGCAAGGACC CGGCCAAGGT GCGGATCGTC ATGGAAGGTT AA
 
Protein sequence
MVRLQDRRFG HIKNIPVEEN PGRAQIKYGD LIQFDPIESV VQLRDADKSS AAHTLVNTYV 
ISKEMAERLT QLVIPQMQFD QPVDNKGLLV VGNYGTGKSH LMSVVSSLAA DASLLEELNH
ASVRDAAAQI AGRFKVIRTE IGATTMSLRD ILVAELEEHF EKLGVEYVFP EAGTIISHKR
AFEDMMAKFG EVFPEHGLLL VVDELLDYLR TRKDQELILD LNFLREVGEV CKDLRFRFMA
GVQEAIFDSP RFAFVADSIR RVKDRFEQIL IARSDVKFVV AERLLKKTAE QQAKIRDYLM
PFAKYYDGLN ERMDEFVRLF PVHPDYIDTF ERVTVVEKRE VLKTLSMGMK GILGKDVPQD
EPGLIAFDSY WNTLKQNASF RAIPEIRAVI DCSQVLESRI ENALTPKNYK PMAVRLIHAL
SVHRLTTGDI YAPMGASAEE LRDRLCLFEP LITEMGSDEP DKDLQTHVET VLRKIHTAVS
GQFISENKDN RQFYLDLKKT DDFDALIDKR AESLGQAQLD RFYYEALKRA MEYQDATYVT
GYKIWQHELV WQEHKAARTG YLFFGAPNER STAVPQRDFY LYFIQPNDPP RFKDGKVSDE
VFFRLKSTDE EFQTALKSYA AALDLAGTSS GHAKATYESK ANGFLKKLVQ WLQKHMSDAF
EVTYQGRAKS MTEWAKGKSI RDLSGLLPHE TINFRDLVNT IAGVCLAPNF ENQAPDYPFF
SVLITGNNRA QAAQDALRTI AGQNRTKQAT AVLDALDLLD GEKIDPYKSK YTKFILDAVK
AKGHGQVVNR SEIIQDDQGL EYMNPGGSRL EPEWVSVLVA ALVYSGDIVL AIPGKKFDAT
GLQQLAATGM DELARFKHLE QPKEWNLPAL KALFELLGMT PGMAQLVTQG KDEPVQNLQQ
AVGKIVKRIV MTQQTLREGL SFWGLDLLVG TDLASQTSGL DEAKGFFESL QAYSSPGKLK
NFRYSAPEVL AHEKAMKALD ELDALRAFIM DNGPTASWLS TAEAVLPAEH DWVDRMKTTR
QDVLDAFKQA NLTELTSQSQ SPLSGIGAKL QKLKKDYTVA YIGLHTKARL GVNEDKRKAG
LLNDPRLQTL RKLAGIDLMP RQQLTDYQNR LAGLKSCFAL TEQNLDASSI CPHCGFRPAV
ETGAAVGSQM IDQMDTQLDA MVAAWTSTIL SNLEGPITQA NMDLLKIDDR EPLEAFIKSK
ELPVPLDSNF VHALKEVLSG LVRVTAKAQE LQQALQVTDG PATPVEMKKR FEEYIDQLTK
GKDPAKVRIV MEG