Gene Noc_1063 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1063 
Symbol 
ID3707246 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1169176 
End bp1172565 
Gene Length3390 bp 
Protein Length1129 aa 
Translation table11 
GC content49% 
IMG OID637737568 
ProductTPR repeat-containing protein 
Protein accessionYP_343101 
Protein GI77164576 
COG category[R] General function prediction only 
COG ID[COG0457] FOG: TPR repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.194908 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCGCTTAA GATTTTCTGT GCTAGGGATA GCGCTTTTTG TTTTTGGGTT TTCCTGGGTA 
GGGGCGGATG AGCGTTGCGC CTCGCCGGTG GCGCAAATTG TTTCCCTCCA GGGGCGGGTG
GAGGTCACTC CAGTGGACGA AAGGCGTTGG CGATCGGTGG GGTTGCGAGA GAAATTTTGC
GCCGGAGACA GGATTCGCAT CGAGGCTTAT AGTCGTGCCT TGGTGCAGCT TCAAGATAAT
ACTCTCCTGC ACCTGGATGG GGGTACCCTA GTGACTTTTT CAGGCATTGA ACCTAACAAG
CCTTCCTGGT TTGAACTCCT GAAGGGCGCT ATCCACTTAA TCAGCCGTTT TCCCCATCGG
CTGGAAGTCA AAACGCCCTT TGTGAATGCG GCGGTGGAAG GCACTGAATT TGCGATTCGG
GTAGAACCGG AAAAAGCCTT GCTCTGGGTC TTCGAGGGGC GGGTCCTTTT TCACAATCCT
ACTGGCCAGC TCACCGTCAC CAGCGGCGAG GCGGCGGTGG CCGAGGCCGG CCAAGCGCCG
CGGCGGCGGC TGGTGATCCA ACCCCGGGAA GCAGTGCAAT GGGCGCTCTA TTATCCGCCG
CTCATTGATC TGCGTCCCAG CGTTTATCCA AGCGGTCCAG AAGCCCAAGG AATTCACGTG
GCGTTGCGGG CCTATCGGGA TGGGGATTTG CTCACCGCCC TGGGCCGCTT GGAACAGGTT
CCGATTGGAG CGCGAGAGGC CAGCTATTTC ACTTTGCAGG CGGCCTTGCT CCTAGTGGTG
GGGCGCATCG ATGAAGCCCG CCCCAACATT CAGCGTGCTT TGCAGCTCGA TCCCGACCAC
GGAACCGCTT ATGCCCTGCA AGCCATCATC GCCCTGGCGC AGAACCGAAA AGAGGACGCC
CTTCGGTTGG CCCGGCAGGG GGCTAAGCTC GATCCCCAGT CCTCCATTCC TCAAATCGCC
CTGTCCTATG TTTATCAGGG AAGGTTTAAT ATCGAGCAAG CGTTACAACA TGCCCAGCAG
GCTACCGAGC TTTTCCCTGG CGAGGCCCTT GCCTGGGCGA GGGTAGCGGA ATTACAACTC
TCCCTGGGTG ATTTGGATGG AGCCGCCAAA GCCGCCCAAC AGGCGGTAGC CCTCGACCCG
GATTTAGCTC GGACCCAAAC CGTGCGGGGC TTTGCCGAGT TGACGGCCAT TGATATTGAG
GAAGCCAAAG CGAGTTTTCA GCGGGCCATT GAACTGGACC CCGCCGATCC CCTCTCCCGC
CTTGGGTTGG GACTGGCCAA AATCCGCCAG GGGGATCTCA AGGCAGGCAC TGAGGAAATC
GAAATTGCCG CCAGCCTGGA CCCCAATAAT TCGCTGATTA GAAGTTATCT CGGCAAAGCC
TATTACGACC AGAAACGGGG AGAGGCGGCC GCAACGGAGC TGGCGATAGC CAAGGAACTC
GATCCTAACG ATCCCACCCC CTGGTTCTAC GACGCCATTC GCAAGCAAAC GACGAATAGG
CCGGTGGAAG CCTTGCATGA CATGCAAAAG GCCATTGAAC TGAATGATAA CCGGGCGGTG
TACCGCTCGC GATTACTTAT GGACCAAGAT CTGGCCGCAA GAAGTGCGAG TCTCGGACGC
ATTTACAACG ATTTGGGCTT TCAGCAACGG GGCCTATTGG AGGGATGGAA ATCGGTTAAC
ACTGATCCCA GCAACTACTC CGCCCATCGG CTATTGGCGG ACAACTATGC GGCATTACCA
AGGCATGAAA TTGCCAGGGT AAGTGAGTTA TTGCAATCCC AACTTTTGCA ACCGCTTAAC
CTCACGCCCG TGCAGCCAAG CTTGGCAGAA AGTAATCTGC TTCTTTTAGA AGGTGCCGGT
CCTTCAGGGC TTGCGTTTAA TGAGTTTAAC CCATTGTTCA CGCGCAATCG CCTAGCCTTG
CAGGCATCCG GTGTTTTTGG CAGCAATGAT ACCTTGGGTG ATGAGGTGAC TCAGTCGGGG
CTGTGGAAAA ATTTCTCTTA TAGTGTGGGA CAGTTTCATT CTGAAACGGA CGGATTTCGG
GAAAATAGCG ACTTTGCGCG AAACACTTAT AATGTATTTA CTCAAGGGGC TTTATCTCCT
AATACAAACT TGCAAGCGGA ATTTAGACAT GATGAGCGCA TACAAGGAGA TTTAGCTCTT
AGATTTGATC CAAATTTTTC CAAAGTTCTT CGCGAAACCA GTCGTGTCAA CACATACAGA
TTAGGAGCGC GGCATGCTTT TTCGCCTAAC TCGCAGATTA TAGCTTCATT AAGTTATCAA
AACGTTAATG TTAAGCAAAA AACACAAACC CAAAGAACCA TTTCCATTCC TACCCCACTT
GGCCCGTTAG AAACAGAAAT TCTGATTCCT ACAGAAGCTA CGATTAATAG AAATGGATTT
ATAGGCGAAT TACAACATTT CTATACTAAT GAAAAAGCCA CGGTAATTTC TGGTTTCGGT
CATATTAACA ATGATGTTAT TCAAAATGTC ACTTTTCCTG AGAATAAACC GCCTTTGACT
GAAGTTATTA CCCACCCAGA TATAAGAAAA GTTAACATTT ATAATTATTC TCAGATCCGT
GCTTTTGATA AGTTAACTGC AATACTGGGG TTAAGCATAG ATTCTTTGGA AATAAGAGGT
CAACTCGATA AAACTCAAGT GAACCCAAAA TTCGGTCTAA TTTGGATGCT GCATTCTTCA
ACGACTCTTC GGCTGGCAGG GTTTAGAAGC ATGAGTACCA CAAGAACTGC TAATCAGACT
ATTGAGCAGA CACAGGTTGC TGGTTTTAAT CAATTTTTCG ATGATGTAAA TGGGACCGAT
GCTTGGCGTT ACGGGGCTGC GGTTGATCAT GTTTTTTCTA AATATTTTTA TGGAGGAGTT
GAGTATTCAG AACGAAAATT AGATGTTCCG GTACTTATCT CCCAAGGTTC AAAGGCGCAA
TTCGTTAATT GGAAAGAAAA GACATCACGA ACCTATTTTT ATCTAACGCC AAATTCTAAT
TTTGCGGCAA GTATCGAATA TTTTTTTGAG CGTTTTGACC GGCGCTCTAA TCCGTTACGG
ACTGGAATTG TTGATGTGGC AACACATCGG GTTCCAGTAG GCTTAAGTTT TTTTCATCCT
TTAGGTTTTT CGGCTAACCT CAAGGCCACC TATGTTAATC AATCCGGATT TTTCCAAAGA
CGTAATTCGG ATGATATATT TAATGATCAG AGCGGGTTTA TCGTTGTTGA TATGAGCTTG
AACTATAGAT TGCCGAAAAG ATTTGGAATT ATTAGAGTAG GCTCAAAAAA TTTATTTAAT
GAGAGATTTA AATATCAAGA CATGGATCCA AATATGCCAT TGTTTTTTCC GGAGCGATTT
TTGTACACAC AATTGACTTT GGCATTCTAA
 
Protein sequence
MRLRFSVLGI ALFVFGFSWV GADERCASPV AQIVSLQGRV EVTPVDERRW RSVGLREKFC 
AGDRIRIEAY SRALVQLQDN TLLHLDGGTL VTFSGIEPNK PSWFELLKGA IHLISRFPHR
LEVKTPFVNA AVEGTEFAIR VEPEKALLWV FEGRVLFHNP TGQLTVTSGE AAVAEAGQAP
RRRLVIQPRE AVQWALYYPP LIDLRPSVYP SGPEAQGIHV ALRAYRDGDL LTALGRLEQV
PIGAREASYF TLQAALLLVV GRIDEARPNI QRALQLDPDH GTAYALQAII ALAQNRKEDA
LRLARQGAKL DPQSSIPQIA LSYVYQGRFN IEQALQHAQQ ATELFPGEAL AWARVAELQL
SLGDLDGAAK AAQQAVALDP DLARTQTVRG FAELTAIDIE EAKASFQRAI ELDPADPLSR
LGLGLAKIRQ GDLKAGTEEI EIAASLDPNN SLIRSYLGKA YYDQKRGEAA ATELAIAKEL
DPNDPTPWFY DAIRKQTTNR PVEALHDMQK AIELNDNRAV YRSRLLMDQD LAARSASLGR
IYNDLGFQQR GLLEGWKSVN TDPSNYSAHR LLADNYAALP RHEIARVSEL LQSQLLQPLN
LTPVQPSLAE SNLLLLEGAG PSGLAFNEFN PLFTRNRLAL QASGVFGSND TLGDEVTQSG
LWKNFSYSVG QFHSETDGFR ENSDFARNTY NVFTQGALSP NTNLQAEFRH DERIQGDLAL
RFDPNFSKVL RETSRVNTYR LGARHAFSPN SQIIASLSYQ NVNVKQKTQT QRTISIPTPL
GPLETEILIP TEATINRNGF IGELQHFYTN EKATVISGFG HINNDVIQNV TFPENKPPLT
EVITHPDIRK VNIYNYSQIR AFDKLTAILG LSIDSLEIRG QLDKTQVNPK FGLIWMLHSS
TTLRLAGFRS MSTTRTANQT IEQTQVAGFN QFFDDVNGTD AWRYGAAVDH VFSKYFYGGV
EYSERKLDVP VLISQGSKAQ FVNWKEKTSR TYFYLTPNSN FAASIEYFFE RFDRRSNPLR
TGIVDVATHR VPVGLSFFHP LGFSANLKAT YVNQSGFFQR RNSDDIFNDQ SGFIVVDMSL
NYRLPKRFGI IRVGSKNLFN ERFKYQDMDP NMPLFFPERF LYTQLTLAF