Gene Noc_1987 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoc_1987 
Symbol 
ID3704871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNitrosococcus oceani ATCC 19707 
KingdomBacteria 
Replicon accessionNC_007484 
Strand
Start bp2282999 
End bp2285803 
Gene Length2805 bp 
Protein Length934 aa 
Translation table11 
GC content51% 
IMG OID637738463 
ProductTPR repeat-containing protein 
Protein accessionYP_343979 
Protein GI77165454 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID[TIGR02917] putative PEP-CTERM system TPR-repeat lipoprotein 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0465352 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCGAGCC CACTAATAAG CCCTATGATC CTAATCTGCT CACTATTATT AGCAATTGCC 
CTGTTAACAG GCTGCAGCGG GGATTCCAAC CTCACCCCCG AAGAGCACAT CTCACGCGCC
AAGGAATATC AGGATCAGGG AAAAATAAGG GCCACCATTA TCGAATTGAA AAATGCCCTC
CAGAAAACAC CAGACAATCA AGAGGCGCGC TGGCTTCTAG GTCAAACCTA CGTGAAAGCG
GGTGATGGGC CATCCGCCGA AAAAGAACTA AAGCGAGCTC TCAGCCTCGG GTTAGCCTCC
GAAGCTGCGG CTATTTATCT AACCCGCGCT GCCCTCTTGC AAAGAGAATT TCAAACTGCC
ATTGAAACCT CGACCGATTA CCCTGCCCTT CCCGAAGATG AACAGGCAGA GCTGCTAGCC
CTACGGGGAC ATGCCTATCT TGGACTACGA GAACTTGAAA AAGCAGAAAA ATCATACGAA
TCCGCCCTTT CCATTAACCC TGCTGCCCCG GAGGCCGGTT TTGGAAAAGC ACGAATCGCC
GCAGTGCAAA ACCGCCTGGA AGAGACCCGG CAATGGCTGG AAAAAGTCCT TCAGACTACG
CCAAGTTTCG CTCCCGCCTG GAGTCTGCTA GGTGACCTAG AGCGTTATCA AGGAAACGGG
GAAGCCGCTG AACAAGCCTA TGGGAAAGCC ATTGCCCACC GCTTCAATAA TGCCAGCGAT
CTGCTCAACC GGGCATTGGT GCGCATTTAC TTGAAAGACT ATGAGGGCGC GGCCAGCGAT
TTGGAAACGC TGAGTAAACG CGCGCGCAAT CATCCAGGAG TAACCTACGC CCAGGGATTA
TTACATTTCC AGCAGCAGCA GTACGCTGAC GCTTTAACAA GCTTCCAGAA AACACTAAGC
AAAAATCCAG AATACATGCC TGCAGTATTC TACGCCGGTA TCGCCTATTA CCAACAAGGA
CAGTTAACGC AAGCTGGACA GCTCTTAAAT CAGTTCCTAA AGCGTTTTCC CCATTCCGAT
ACTGCGGCTA AGACTCTGGC CATGATACGT CTTCGCGAGG GCAACTATAC GAGCGCCCAA
GCTATCCTAG AGCCTATTAT AGCCCAAAAT CCCAATGATA CTGCCGCCCT GGATTTACTA
GGAAGCGCAA TCCTAGGGCA AGGCAAGCCT GAAAAGAGCG CTGCCTACTT TCAAAAAGTC
ACCGCGCAAA CGCCAGAGTC GGCGGCAGCC TACATGAAAC TAGGGCTTGG CTTCATGATG
TCCGGGGAAC ATGAGCAAGG CATTGGCGCT CTGGAAAAAG CCATCGAATT AGACTCTCAA
CTCCCCCAGG CCGACCGGTT AATAATCCTT GGCCACTTGC GGGCCCAGGA ATTCGATAAA
GCTCTTGCAG CGGCTAAACG ACTGAGAGAG AAACAACCAG ACAGCCCCCT GCCCATAAAT
CTAATTGGCG CCGCCTATCT TGGCAAAGGA GAAGAAAGCA AAGCCCAGGA GGCATTTCGC
CAAGCTTTAG AAATCGCTCC AGGTGATCCA TCAGCGACTC ATAATCTTGC CATGCTGGCC
ATCAAAAAGG GAAATATTGA GAAGGCCCAC GCTCTCTACC AAGAAGCGCT CAGATACCAT
CCGGGTCATC TCAGAACATT GCTCAAGCTC AGTGCGCTAG AAGCACAGCA GGGCCATCCG
GAGAAAGCAA AGAACTGGGT AGAGCAGGCT ATGGAGAAAA ATTCTAAAGC CCTAGAACCT
CGGGTATTGC TAGCCCGTTA CTATTTGGAG CAAGGTAGGC CCGCCCGGAG TCTTGCTATC
ACGCGGGAAA TCCAGGATCT TTATCCTGCT CATCCGGCAT TATTGCTCGT GGTGGGAACG
GCACAACTGG AAAACAGCCA GCTACGGGAT GGGGTAAAAA CCTTTCAGAA ACTGGTGGAG
GTCCAGCCCC AGTCGGCCCA AGCCCACTAC CTTCTGGCCA AAGCTTATGC CACGGTAAAT
AATACGGATA AGCTCCGTAA AGAGCTCGAA CAAGCGCTTA AACTGAACCC CAACCATACT
CTTTCTAAAA TCGCCATGAC TCGCCTGCTG ATGCAGGAGA ATCAACCTGA AGCGGCTAAT
AAGCTATTCC AGGAATTAAA ACAGGCCTAC CCAGAACATC CGGAAGTGCT GGCCCAGGAA
GGCTGGTTGG CCATGCGTCA GAACCGGCCT CAGGACGCAA TAATAGCCTT CAGGGAAGCA
CTAAAACGCT CCCCCACCAG TCAAATCATC GTCAATCTTG CGCACGCCCA ACTCCAGGCG
GGGAATCAAA ATGAGAGCTT GGCGACCCTA GAGGATTGGT TGAAAAAGCA TCCGGAGGAT
ATGGTAGTGC AGTATAATTT GGCTAATCTC TATCTAGCAC TTAAACAAGA ACAGAAGGCA
GCGTCAGCTT TTACTACAGT AGTCAAGCGA GCGCCAGATA ATGTGGTGGC CCTTAATAAC
CTGGCCTGGC TGCTGCGCAA AAACGATCCG GCCAAAGCTT TAGAATATGC TGAGCGGGCT
TTGGAGCTGG CTCCTAATGC ACCACCCGTC ATGGACACCT TGGGGATGTT GCTTTTAGAA
AAGGGTGAAG CAAAACGCAG CCTACGCTTG CTCAGGAAAG CCTCGGACAG GGCGCCTGAG
AACCTAACTA TTCGGTACCA TTTTGCCTTG GCTCTGGCGC AGAATGGAGA AAGTGCCCAG
GCCCGGCAAG TGCTCGATGG GATCTTAGAT GCAAAGCAAC CCTTTGCCAA GAAAAAGGAA
GCCCATGCGC TGCGTCAAAC CTTAAGCAAA TCACTAAATG ATTGA
 
Protein sequence
MPSPLISPMI LICSLLLAIA LLTGCSGDSN LTPEEHISRA KEYQDQGKIR ATIIELKNAL 
QKTPDNQEAR WLLGQTYVKA GDGPSAEKEL KRALSLGLAS EAAAIYLTRA ALLQREFQTA
IETSTDYPAL PEDEQAELLA LRGHAYLGLR ELEKAEKSYE SALSINPAAP EAGFGKARIA
AVQNRLEETR QWLEKVLQTT PSFAPAWSLL GDLERYQGNG EAAEQAYGKA IAHRFNNASD
LLNRALVRIY LKDYEGAASD LETLSKRARN HPGVTYAQGL LHFQQQQYAD ALTSFQKTLS
KNPEYMPAVF YAGIAYYQQG QLTQAGQLLN QFLKRFPHSD TAAKTLAMIR LREGNYTSAQ
AILEPIIAQN PNDTAALDLL GSAILGQGKP EKSAAYFQKV TAQTPESAAA YMKLGLGFMM
SGEHEQGIGA LEKAIELDSQ LPQADRLIIL GHLRAQEFDK ALAAAKRLRE KQPDSPLPIN
LIGAAYLGKG EESKAQEAFR QALEIAPGDP SATHNLAMLA IKKGNIEKAH ALYQEALRYH
PGHLRTLLKL SALEAQQGHP EKAKNWVEQA MEKNSKALEP RVLLARYYLE QGRPARSLAI
TREIQDLYPA HPALLLVVGT AQLENSQLRD GVKTFQKLVE VQPQSAQAHY LLAKAYATVN
NTDKLRKELE QALKLNPNHT LSKIAMTRLL MQENQPEAAN KLFQELKQAY PEHPEVLAQE
GWLAMRQNRP QDAIIAFREA LKRSPTSQII VNLAHAQLQA GNQNESLATL EDWLKKHPED
MVVQYNLANL YLALKQEQKA ASAFTTVVKR APDNVVALNN LAWLLRKNDP AKALEYAERA
LELAPNAPPV MDTLGMLLLE KGEAKRSLRL LRKASDRAPE NLTIRYHFAL ALAQNGESAQ
ARQVLDGILD AKQPFAKKKE AHALRQTLSK SLND