Gene Noc_0289 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_0289 
Symbol 
ID3706460 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp315951 
End bp317456 
Gene Length1506 bp 
Protein Length501 aa 
Translation table11 
GC content55% 
IMG OID637736804 
Productleucyl aminopeptidase 
Protein accessionYP_342348 
Protein GI77163823 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0260] Leucyl aminopeptidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value0.823293 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATTTTC ACGTTACCAG TGGGACCCCC GAGAAACAAC GCACCGCTGC CCTTGTGGTG 
GGCATCTATG AAGACGAAAA ACTCTCTTCC TATGCCCAGC GGATTGATAA AGCCAGCGAA
GGTTATGTGT CTCGGCTTAT CAAGCAAGGA GATTTTACCG GCAAAAAGGG ACAAGCCCTT
CTGCTTTTTG CTCTCCCAGG CGTTAAAGCC GAGCGGGTTT TACTGATGGG ATGTGGCCAG
AAGGACAAGG TAACGGCCAA GAATTTACGC CAGAGCTGGT CGGGCGCCGT CAAGGCGCTA
CAAGCCTGTG GCGCTACCGA GGCAATGATC TGTCCGCTGG AAGCGAAGCC CAAGGACGAG
GAACTTACCC AATGGGCGCG GCTCATCGTA GAAACGGCTG AACAGGCTTT ATATCGTTAC
GAACACACTA AGAGCAAAAA GGAATCCTTA AAAAAGCCGC TCGCCAAGCT CACTTTGCTA
TTGGATCAAC GCTCCCAACA ACCACTAGCG GAACAGGGTA TCCAGCAAGG TCAGGCCATT
GCCAAAGGTG TTAACCTGGC CCGGGACTTG GGCAATCTAC CGGGGAATAT TTGCACGCCT
ACTTATTTGG CCGACGAAGC CCGCCGATTA GCCAAAGAAT ACAAGTCATT AAAGGCAAAA
ATCCTGGAGC AAGCCGAGAT GGAAAAGCTC GGGTTAGGAG CGCTGCTTGC CGTATCCCGG
GGCAGCCGGC AGCCGCCCAA GCTCATTACC CTGGAGTATA AAGGCGCCCC CGGCAAGCAA
AAACCCATTG TGCTGGTAGG TAAGGGATTG ACTTTCGATG CAGGCGGCAT CTCCATCAAG
CCTGGGGAAC GCATGGACGA AATGAAATAC GATATGTGTG GCGGGGCAGG CGTTTTAGGG
ACGATGCAAG CTTGCGCCGA GCTGGAATTG CCCCTCAACG TGATTGCCGT CGTACCCAGC
TCCGAAAATC TCCCCGACGG CGCGGCTAAC AAACCGGGAG ACGTGCTTAC CAGCTTATCG
GGCCAAACCA TCGAGGTGCT CAACACAGAT GCCGAGGGCC GTTTGATTCT CTGTGATGCG
TTGACCTACA GCAAACGCTA CCGGCCTGAT GTGGTCATTG ATGTGGCGAC CCTCACGGGA
GCCTGCGTGA TTGCCCTGGG TGCCCATGCC AGTGGTTTAC TGAGCAACGA TCAGAGCCTG
GCGGAGCACT TGCTCGCCGC TGGACAAACC AGTGATGACC GTGCTTGGCA GCTTCCCCTC
TGGGACGATT ACCAGCAGCA GCTGGACAGC AATTTTGCGG ATATGGCCAA CATTGGCGGC
CGGGGGGCCG GCACTATTAC CGCGGCTTGT TTCCTGGCCC GCTTTACCGA AGAGTTTCGC
TGGGCCCATT TGGACATTGC CGGTACCGCC TGGCTCAGCG GTAAAGAAAA AGGGGCAACC
GGACGCCCGG TGCCTTTGCT CACCCAGTAT CTTATTCAAC GCGCCCAAGA AGCGAAAACT
TCATAA
 
Protein sequence
MNFHVTSGTP EKQRTAALVV GIYEDEKLSS YAQRIDKASE GYVSRLIKQG DFTGKKGQAL 
LLFALPGVKA ERVLLMGCGQ KDKVTAKNLR QSWSGAVKAL QACGATEAMI CPLEAKPKDE
ELTQWARLIV ETAEQALYRY EHTKSKKESL KKPLAKLTLL LDQRSQQPLA EQGIQQGQAI
AKGVNLARDL GNLPGNICTP TYLADEARRL AKEYKSLKAK ILEQAEMEKL GLGALLAVSR
GSRQPPKLIT LEYKGAPGKQ KPIVLVGKGL TFDAGGISIK PGERMDEMKY DMCGGAGVLG
TMQACAELEL PLNVIAVVPS SENLPDGAAN KPGDVLTSLS GQTIEVLNTD AEGRLILCDA
LTYSKRYRPD VVIDVATLTG ACVIALGAHA SGLLSNDQSL AEHLLAAGQT SDDRAWQLPL
WDDYQQQLDS NFADMANIGG RGAGTITAAC FLARFTEEFR WAHLDIAGTA WLSGKEKGAT
GRPVPLLTQY LIQRAQEAKT S