Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_1063 |
Symbol | |
ID | 3707246 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 1169176 |
End bp | 1172565 |
Gene Length | 3390 bp |
Protein Length | 1129 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637737568 |
Product | TPR repeat-containing protein |
Protein accession | YP_343101 |
Protein GI | 77164576 |
COG category | [R] General function prediction only |
COG ID | [COG0457] FOG: TPR repeat |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.194908 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCGCTTAA GATTTTCTGT GCTAGGGATA GCGCTTTTTG TTTTTGGGTT TTCCTGGGTA GGGGCGGATG AGCGTTGCGC CTCGCCGGTG GCGCAAATTG TTTCCCTCCA GGGGCGGGTG GAGGTCACTC CAGTGGACGA AAGGCGTTGG CGATCGGTGG GGTTGCGAGA GAAATTTTGC GCCGGAGACA GGATTCGCAT CGAGGCTTAT AGTCGTGCCT TGGTGCAGCT TCAAGATAAT ACTCTCCTGC ACCTGGATGG GGGTACCCTA GTGACTTTTT CAGGCATTGA ACCTAACAAG CCTTCCTGGT TTGAACTCCT GAAGGGCGCT ATCCACTTAA TCAGCCGTTT TCCCCATCGG CTGGAAGTCA AAACGCCCTT TGTGAATGCG GCGGTGGAAG GCACTGAATT TGCGATTCGG GTAGAACCGG AAAAAGCCTT GCTCTGGGTC TTCGAGGGGC GGGTCCTTTT TCACAATCCT ACTGGCCAGC TCACCGTCAC CAGCGGCGAG GCGGCGGTGG CCGAGGCCGG CCAAGCGCCG CGGCGGCGGC TGGTGATCCA ACCCCGGGAA GCAGTGCAAT GGGCGCTCTA TTATCCGCCG CTCATTGATC TGCGTCCCAG CGTTTATCCA AGCGGTCCAG AAGCCCAAGG AATTCACGTG GCGTTGCGGG CCTATCGGGA TGGGGATTTG CTCACCGCCC TGGGCCGCTT GGAACAGGTT CCGATTGGAG CGCGAGAGGC CAGCTATTTC ACTTTGCAGG CGGCCTTGCT CCTAGTGGTG GGGCGCATCG ATGAAGCCCG CCCCAACATT CAGCGTGCTT TGCAGCTCGA TCCCGACCAC GGAACCGCTT ATGCCCTGCA AGCCATCATC GCCCTGGCGC AGAACCGAAA AGAGGACGCC CTTCGGTTGG CCCGGCAGGG GGCTAAGCTC GATCCCCAGT CCTCCATTCC TCAAATCGCC CTGTCCTATG TTTATCAGGG AAGGTTTAAT ATCGAGCAAG CGTTACAACA TGCCCAGCAG GCTACCGAGC TTTTCCCTGG CGAGGCCCTT GCCTGGGCGA GGGTAGCGGA ATTACAACTC TCCCTGGGTG ATTTGGATGG AGCCGCCAAA GCCGCCCAAC AGGCGGTAGC CCTCGACCCG GATTTAGCTC GGACCCAAAC CGTGCGGGGC TTTGCCGAGT TGACGGCCAT TGATATTGAG GAAGCCAAAG CGAGTTTTCA GCGGGCCATT GAACTGGACC CCGCCGATCC CCTCTCCCGC CTTGGGTTGG GACTGGCCAA AATCCGCCAG GGGGATCTCA AGGCAGGCAC TGAGGAAATC GAAATTGCCG CCAGCCTGGA CCCCAATAAT TCGCTGATTA GAAGTTATCT CGGCAAAGCC TATTACGACC AGAAACGGGG AGAGGCGGCC GCAACGGAGC TGGCGATAGC CAAGGAACTC GATCCTAACG ATCCCACCCC CTGGTTCTAC GACGCCATTC GCAAGCAAAC GACGAATAGG CCGGTGGAAG CCTTGCATGA CATGCAAAAG GCCATTGAAC TGAATGATAA CCGGGCGGTG TACCGCTCGC GATTACTTAT GGACCAAGAT CTGGCCGCAA GAAGTGCGAG TCTCGGACGC ATTTACAACG ATTTGGGCTT TCAGCAACGG GGCCTATTGG AGGGATGGAA ATCGGTTAAC ACTGATCCCA GCAACTACTC CGCCCATCGG CTATTGGCGG ACAACTATGC GGCATTACCA AGGCATGAAA TTGCCAGGGT AAGTGAGTTA TTGCAATCCC AACTTTTGCA ACCGCTTAAC CTCACGCCCG TGCAGCCAAG CTTGGCAGAA AGTAATCTGC TTCTTTTAGA AGGTGCCGGT CCTTCAGGGC TTGCGTTTAA TGAGTTTAAC CCATTGTTCA CGCGCAATCG CCTAGCCTTG CAGGCATCCG GTGTTTTTGG CAGCAATGAT ACCTTGGGTG ATGAGGTGAC TCAGTCGGGG CTGTGGAAAA ATTTCTCTTA TAGTGTGGGA CAGTTTCATT CTGAAACGGA CGGATTTCGG GAAAATAGCG ACTTTGCGCG AAACACTTAT AATGTATTTA CTCAAGGGGC TTTATCTCCT AATACAAACT TGCAAGCGGA ATTTAGACAT GATGAGCGCA TACAAGGAGA TTTAGCTCTT AGATTTGATC CAAATTTTTC CAAAGTTCTT CGCGAAACCA GTCGTGTCAA CACATACAGA TTAGGAGCGC GGCATGCTTT TTCGCCTAAC TCGCAGATTA TAGCTTCATT AAGTTATCAA AACGTTAATG TTAAGCAAAA AACACAAACC CAAAGAACCA TTTCCATTCC TACCCCACTT GGCCCGTTAG AAACAGAAAT TCTGATTCCT ACAGAAGCTA CGATTAATAG AAATGGATTT ATAGGCGAAT TACAACATTT CTATACTAAT GAAAAAGCCA CGGTAATTTC TGGTTTCGGT CATATTAACA ATGATGTTAT TCAAAATGTC ACTTTTCCTG AGAATAAACC GCCTTTGACT GAAGTTATTA CCCACCCAGA TATAAGAAAA GTTAACATTT ATAATTATTC TCAGATCCGT GCTTTTGATA AGTTAACTGC AATACTGGGG TTAAGCATAG ATTCTTTGGA AATAAGAGGT CAACTCGATA AAACTCAAGT GAACCCAAAA TTCGGTCTAA TTTGGATGCT GCATTCTTCA ACGACTCTTC GGCTGGCAGG GTTTAGAAGC ATGAGTACCA CAAGAACTGC TAATCAGACT ATTGAGCAGA CACAGGTTGC TGGTTTTAAT CAATTTTTCG ATGATGTAAA TGGGACCGAT GCTTGGCGTT ACGGGGCTGC GGTTGATCAT GTTTTTTCTA AATATTTTTA TGGAGGAGTT GAGTATTCAG AACGAAAATT AGATGTTCCG GTACTTATCT CCCAAGGTTC AAAGGCGCAA TTCGTTAATT GGAAAGAAAA GACATCACGA ACCTATTTTT ATCTAACGCC AAATTCTAAT TTTGCGGCAA GTATCGAATA TTTTTTTGAG CGTTTTGACC GGCGCTCTAA TCCGTTACGG ACTGGAATTG TTGATGTGGC AACACATCGG GTTCCAGTAG GCTTAAGTTT TTTTCATCCT TTAGGTTTTT CGGCTAACCT CAAGGCCACC TATGTTAATC AATCCGGATT TTTCCAAAGA CGTAATTCGG ATGATATATT TAATGATCAG AGCGGGTTTA TCGTTGTTGA TATGAGCTTG AACTATAGAT TGCCGAAAAG ATTTGGAATT ATTAGAGTAG GCTCAAAAAA TTTATTTAAT GAGAGATTTA AATATCAAGA CATGGATCCA AATATGCCAT TGTTTTTTCC GGAGCGATTT TTGTACACAC AATTGACTTT GGCATTCTAA
|
Protein sequence | MRLRFSVLGI ALFVFGFSWV GADERCASPV AQIVSLQGRV EVTPVDERRW RSVGLREKFC AGDRIRIEAY SRALVQLQDN TLLHLDGGTL VTFSGIEPNK PSWFELLKGA IHLISRFPHR LEVKTPFVNA AVEGTEFAIR VEPEKALLWV FEGRVLFHNP TGQLTVTSGE AAVAEAGQAP RRRLVIQPRE AVQWALYYPP LIDLRPSVYP SGPEAQGIHV ALRAYRDGDL LTALGRLEQV PIGAREASYF TLQAALLLVV GRIDEARPNI QRALQLDPDH GTAYALQAII ALAQNRKEDA LRLARQGAKL DPQSSIPQIA LSYVYQGRFN IEQALQHAQQ ATELFPGEAL AWARVAELQL SLGDLDGAAK AAQQAVALDP DLARTQTVRG FAELTAIDIE EAKASFQRAI ELDPADPLSR LGLGLAKIRQ GDLKAGTEEI EIAASLDPNN SLIRSYLGKA YYDQKRGEAA ATELAIAKEL DPNDPTPWFY DAIRKQTTNR PVEALHDMQK AIELNDNRAV YRSRLLMDQD LAARSASLGR IYNDLGFQQR GLLEGWKSVN TDPSNYSAHR LLADNYAALP RHEIARVSEL LQSQLLQPLN LTPVQPSLAE SNLLLLEGAG PSGLAFNEFN PLFTRNRLAL QASGVFGSND TLGDEVTQSG LWKNFSYSVG QFHSETDGFR ENSDFARNTY NVFTQGALSP NTNLQAEFRH DERIQGDLAL RFDPNFSKVL RETSRVNTYR LGARHAFSPN SQIIASLSYQ NVNVKQKTQT QRTISIPTPL GPLETEILIP TEATINRNGF IGELQHFYTN EKATVISGFG HINNDVIQNV TFPENKPPLT EVITHPDIRK VNIYNYSQIR AFDKLTAILG LSIDSLEIRG QLDKTQVNPK FGLIWMLHSS TTLRLAGFRS MSTTRTANQT IEQTQVAGFN QFFDDVNGTD AWRYGAAVDH VFSKYFYGGV EYSERKLDVP VLISQGSKAQ FVNWKEKTSR TYFYLTPNSN FAASIEYFFE RFDRRSNPLR TGIVDVATHR VPVGLSFFHP LGFSANLKAT YVNQSGFFQR RNSDDIFNDQ SGFIVVDMSL NYRLPKRFGI IRVGSKNLFN ERFKYQDMDP NMPLFFPERF LYTQLTLAF
|
| |