Gene Noc_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1459 
Symbol 
ID3706028 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp1615646 
End bp1618957 
Gene Length3312 bp 
Protein Length1103 aa 
Translation table11 
GC content59% 
IMG OID637737948 
Productpeptidase M28 
Protein accessionYP_343477 
Protein GI77164952 
COG category[E] Amino acid transport and metabolism
[R] General function prediction only 
COG ID[COG0308] Aminopeptidase N
[COG2234] Predicted aminopeptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.246557 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTACCGGC AAAAAATATT ATCTTTCCTC GTTTTTGGGT TGGTAGTGCT GGCCTCAGGG 
ACCGCAAGCG CCCAAATTCA CCATCAACTG CATATTAACT TGGACCCGGA CAGTCATCGT
CTCACGGCGG TAGATACCCT TACCCTGCCG AAGGAGATAT CCTCTCCGGT AACTTTCCGC
CTTCACTCGG GCCTCCAGCC TCAAGCCAAC AACCCAGAGG TACGGCTTCG GCGAGTGAGA
AAAACCCAAT TTACCGAGGA CTATGCGGTC GATCTTCCCC CGGGAGTCCG CCAGTTCACC
TTGGAGTACA GCGGTGAAAT CTTTCATCCT ATTGCCCCCA TCAGCGAGGA ATATGCCCGC
AGCTTTAGCG CCTCGCCGGG CCTCATTGCC CCCGAAGGGG TTTTCTTGGC CGGTCCCAGT
TACTGGTATC CCCATTTTGG CGAGGGCATG GTGACTTTTT CCTTAACCAC CCAGCTTCCG
GCAGGATGGC ACTCGGTCAG CCAGGGGAAG CGCTCCTCCC CCCCTGAACA GGAGGGAAGC
ACCGAGGAAA CCTGGACCAT GGATCACCCC CAGGAAGAAA TCTATCTCAT AGCAGGTTCC
TTTACCCCAT ACCGGGAACA AGCCGGTCAG GTGGAAACCC TGGCGTTTTT GCGCCAGCCC
GATCCCGCCC TGGCCCGCAA GTATCTCGAT GCTACTGCCC AATATATCGA GATGTACCGC
CAATTGATTG GCCCTTATCC TTACCAAAAA TTCGCCCTGG TGGAGAATTT CTGGGAGACG
GGCTACGGGA TGCCCTCCTT TACCCTGCTG GGGTCCCGGG TCATCCGCTT TCCCTTTATT
CTCCGCTCCT CCTATCCCCA TGAAATTCTT CATAACTGGT GGGGTAACGG AGTCTATGTG
GATTACGATT CCGGCAACTG GGCCGAAGGG CTGACCAGCT ATCTGGCGGA CCACCTGCTC
AAGGAGCAGC AAGGCCAGGG AGCGCAATAC CGGCGGGAAA CACTGCAAAA ATACGCTGAT
TATGTGCGGG AAGGACGGGA TTTTCCCCTG ACCGAATTCC GCAGCCGCCA CAGCGCCGCT
ACCCAGGCTG TGGGCTACGG CAAAACCCTG ATGCTGTTTC ACATGCTGCG CCAGCAATTG
GGTGATTCAG CGTTTATCCA GGGCCTGCAA CACCTTTACC GCCAGCACCG GTTCCAAGTC
ACCAGCTTCC AGGAAGTCGC GGAAACTTTC AACCAAGTAA GTGAGCAACC CCTGCGCGCT
TTCTTTAAAC AATGGGCCGA GCGCACGGGC GCGCCTTCTT TGAGAATCCG GGAAGCACAC
GCCCAACCCC TCGACGAGGG CTATCTACTG ACAGCCACTA TCGAGCAGCT TCAGCCCGGC
AAACCTTATC AATTAGACTT GCCTCTGGTT ATTTATCTGG AAGACGCCGA GGAGGTTTAT
CAAACCCGCC TTTCCATGGA GAAAAAAACC CATTCCCTAA AGCTCCAGCT ACCGGCCCGT
CCCCTGCGGC TGGAGATAGA TCCCCAATTT GATGTCTTCC GGCGGCTCCA CCACAATGAA
ATACCGCCGG CCCTTAGCCA AGCCTTTGGC GCTGAGCGGG CCCTGGCCGT CCTGCCCTCC
CAGGCCCCCC AAGCGGTGCG GGAAGGCTAC GCCGCCATGG CCCGAAGCTG GCAGCGGGGA
CGGGAAAATT TGGTAATCAC CACCGATGCG GAGCTGGACG CGCTACCCAC GGATCGCGCT
GTCTGGCTGT TCGGGTGGGA AAACCGCTTC CGTTCCCAGT TTAATGAGGC GCTGAGCGAT
TATGCTTACC AGGCGGATAA AAACGGCCTC AGCCTGGAAA AGAACAAACT CCAGCGGCAA
CAACACTCGG CTGTGGTGGT CACTCGCCCG CGGAATAATC CCGATCAGGC TCTGGCCTGG
ATAGCCACTG ATAACGTGGC CGCCATGCCA GGGCTGGCCC GCAAACTCCC CCACTACGGC
AAATACAGCT ATTTGGGTTT CACCGGCGCC GAGCCGGAGA ATAGCCTAAA GGGCCAGTGG
CCGGTGGTGA ACTCTCCCAT GTCGGTTCTA CTGGCCGAAG ATTCGCCTCC CCTCCAGGCG
GAACTGCCCC CGCGCAAGCC TCTGGCGGAA CTTCCTCCCC TCTTTAAAGC CCAGCGAATG
ATGCAGGATA TCGCTTATCT GGCTGATCCC AAGCTAGCGG GCCGGGGACT GGGCACGCCG
GAGCTGGACC AAGCGGCCCA GTATATTGCC GATAAGTTCC AGGCGGCGGG ACTCAAACCC
GGCGGCGACG AGGGAAGCTA CTATCAGACC TGGACCGCAA CGGCCGGCGA ACCGGAACGA
ACCATCACCC TCCGTAACGT GGTGGGTTTG TCCCCTGGCG CCCGGCCGGA ACTCCCCCCG
GTAGTCGTGG GCGCCCACTA TGACCATTTG GGCCGGGGCT GGCCCGATGT ACACCAGGGC
GATGAAGGCA AAATTCATCC GGGAGCCGAC GATAACGCCA GCGGCATTGC CGTCATGCTG
GAGCTAGCCC GGATTTTGGG ACCCCAATGG CAACCCGAGC GCACGGTAGC GTGGGTCGCC
TTTACTGGGG AAGAAGCCGG CAAGCTGGGC TCGGTCCACT ATGTCCAGCG CTTGGGCGAC
TCGCCTGCAA AAACCACCAT GGCCATGATC AACCTGGATG CGGTGGGCCG TCTTCATGAC
GGCGAACTCA TGGTGCTTGC CGCTGATTCA GCCCGGGAGT GGGCGCATAT TTTTCGGGGA
GCGGGCTTTG TCACTGGCGT CCCTATTCAA ACCGTGGCCC AGGATATCGG CTCCAGTGAT
CAAACGTCTT TCCTCAACGC CGGTATTCCC GCCGTGCAGC TTTTCACGGG ACCCCATGGT
GATTTCCATC GCCCCACGGA CACCCCTGAT AAGATTGATT CTGCGGGCTT GACCAAAATC
GCTGCCGTGC TAAAAGAGGC GGTAGCGTAT CTAGCCTCCC GCCCTGACCC TTTACACGGC
CAATCAGTAG CCCCGGGAGA AGAGGCTCAG GATTCATCCC GTCAGGGCCG TCGCGTCGGC
TTGGGCACGA TACCGGATTT TGGCTGGACC GGCACAGGCG TGCGGATCTC CGGGGTTACG
CCCGATACAC CCGCGGAGGC GGCGGGGCTG CAAAAAGATG ATATTATTAT ACGGCTCAAT
GATAAGACTA TTGACACCCT GGCGGACTTC GCCGGCGTCC TACGCGCCCT CAAAGCGGGC
GATTCCTTAA CCATCGAATT TCTGCGCGAT CAGCAGTCCC GGACGGTGAC GACTCAGGTG
GTCGCGCGAT GA
 
Protein sequence
MYRQKILSFL VFGLVVLASG TASAQIHHQL HINLDPDSHR LTAVDTLTLP KEISSPVTFR 
LHSGLQPQAN NPEVRLRRVR KTQFTEDYAV DLPPGVRQFT LEYSGEIFHP IAPISEEYAR
SFSASPGLIA PEGVFLAGPS YWYPHFGEGM VTFSLTTQLP AGWHSVSQGK RSSPPEQEGS
TEETWTMDHP QEEIYLIAGS FTPYREQAGQ VETLAFLRQP DPALARKYLD ATAQYIEMYR
QLIGPYPYQK FALVENFWET GYGMPSFTLL GSRVIRFPFI LRSSYPHEIL HNWWGNGVYV
DYDSGNWAEG LTSYLADHLL KEQQGQGAQY RRETLQKYAD YVREGRDFPL TEFRSRHSAA
TQAVGYGKTL MLFHMLRQQL GDSAFIQGLQ HLYRQHRFQV TSFQEVAETF NQVSEQPLRA
FFKQWAERTG APSLRIREAH AQPLDEGYLL TATIEQLQPG KPYQLDLPLV IYLEDAEEVY
QTRLSMEKKT HSLKLQLPAR PLRLEIDPQF DVFRRLHHNE IPPALSQAFG AERALAVLPS
QAPQAVREGY AAMARSWQRG RENLVITTDA ELDALPTDRA VWLFGWENRF RSQFNEALSD
YAYQADKNGL SLEKNKLQRQ QHSAVVVTRP RNNPDQALAW IATDNVAAMP GLARKLPHYG
KYSYLGFTGA EPENSLKGQW PVVNSPMSVL LAEDSPPLQA ELPPRKPLAE LPPLFKAQRM
MQDIAYLADP KLAGRGLGTP ELDQAAQYIA DKFQAAGLKP GGDEGSYYQT WTATAGEPER
TITLRNVVGL SPGARPELPP VVVGAHYDHL GRGWPDVHQG DEGKIHPGAD DNASGIAVML
ELARILGPQW QPERTVAWVA FTGEEAGKLG SVHYVQRLGD SPAKTTMAMI NLDAVGRLHD
GELMVLAADS AREWAHIFRG AGFVTGVPIQ TVAQDIGSSD QTSFLNAGIP AVQLFTGPHG
DFHRPTDTPD KIDSAGLTKI AAVLKEAVAY LASRPDPLHG QSVAPGEEAQ DSSRQGRRVG
LGTIPDFGWT GTGVRISGVT PDTPAEAAGL QKDDIIIRLN DKTIDTLADF AGVLRALKAG
DSLTIEFLRD QQSRTVTTQV VAR