Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0511 |
Symbol | |
ID | 3706682 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 551973 |
End bp | 553718 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 52% |
IMG OID | 637737020 |
Product | TPR repeat-containing protein |
Protein accession | YP_342564 |
Protein GI | 77164039 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG5010] Flp pilus assembly protein TadD, contains TPR repeats |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 24 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAAACAGG CTCCTTTCCG ATTAACCGCT GGGTTAGTAG TGTTGCTGCT AGCAGGCTGT GCTACGCAGG AAATGCGCCC AGCGGTTGCC GCTAAGGCTG AAGAGAATAC CACGGGACCG GCGACAGCAA CGGCAATCCC GGAGACAAGG TATCCAGATG TTGAATTGAC CCCTGCCCTG CTGTATCAAC TGCTTTCTGC TGACATTGCG GGCCAGCGCG GTCAAATTGG CTATGCGATG GAGGTTTATT TGCAAGCAGC GGGAGAAACT CGCGATCCGC GCCTCGCGGA GCGGGCAACC CGTATCGCCC TCTTTGCTCG GGATATCAGC GCGGCTACCC AGGCAGCCAG GTTATGGGTG AGGGCGGACC CGAGCAATGG GGATGCGCGC CAAGCTTTAG TCTCTTTGCT CCTCGGGCAA CAGCAATACA ATGAAGCTGA AAATCACCTA GAGCATTTGG TTGCCCTAAG CCCGGAAAGC GGTGAACGTA CTTTTTTAAA GATTGCTACC ATGTTGGCTG GGAGCGCAGA CCCAGAAACA GCATTGGCGT TAATGGGCAA TCTATCCGCC TTTCAAGCCA ACGATCCAGA TGCCCTCTAT GGGTATGCTT ACCTTGCCTT GCAGCTAAAG CAGTTGGATC TCGCCCTGAG CACGGTAGAG CGGGTAATTA TCCAGCGTCC CGAAGCGGAT AGGCCCCTGA TCATGCGGGC TAGAATTCTC CAGCAGCAGG GGCGGGAGGA AATGGCCCTC GAATCGCTGG AAACGGTGAT TGAAAATGAA GAGGCGAGTA TTCCTTTACG TTTAGCCTAT GGGCAAATGT TGATGGAAGC CGGTCAAGTA GCTAAGGCTG AGCGTTTGTT CGAGCAGCTA GAACAAGCGC AGCCCGAGAA TCCAGACGTC CTGTTGGCCC AAGGCTTATT AGCTATGGAA CGGTTGGAGT ATAAGCCGGC GGAAGACTAT TTTCAGCGTT TGCTAAAATT AGGCCAGAAT GTAGACCAAG CTCGTTTTTA TCTAGGGCGG TTAGCTGAAC TACAAAGTAA TGCTGGTAAG GCCATTGATT GGTATGCCTC AATTACGGGC GGCAGGTTGA TGGTGGATGC CCAGGTTCGC CAGGCGGTAG TGACAGCGCA ACAGGGCAAT CTACCGGCCG CCCGTCAACA TTTGCAATTA TTAAGTAGAA AATTCCCCAC TCAGGCCGAT CGGTTTCAGC TTGCAGAAGG TGAGATTCTT ATCAATGCTG GGCGTTATGA GGAGGCGATG AGCCATTATG ATAATGCGCT GCACAGCCGC CCAGATGATA CTAATTTGCT CTACGCTCGC GCTTTGGTGG CAGAAAACCT AGGCCGGCTG GATATTGCTG AACAGGATTT GCAGCGGGTT ATAACCTTAG AGCCGAGTAA TGCCGAGGCC CTCAATGCGC TTGGCTATAC CTTAGCGGAT CGAACGAGAC GTCTTGAGGA AGCGCTACGC TATATTTCCC GGGCAATGAG GTTAAAGCCT AATAATGCGT TTATTCTTGA TAGTATGGGC TGGGTACATT ACCGCTTGGG AAATTACGAC AAAGCGGAGA AATATTTGCG GGAGGCGATG GAGCTCCGCA AAGACCCGGA AATCGCGGCC CATTTGGGTG AAGTGCTGTG GGCAAAGGGT GACAGGGAGG GTGCCCGCCA GGTATGGCAA CATACGCTTA AAATGAACCA AGGCAATAAA GTGCTTTTGG AAGTTATGCA GCGCTTTCAG GAATGA
|
Protein sequence | MKQAPFRLTA GLVVLLLAGC ATQEMRPAVA AKAEENTTGP ATATAIPETR YPDVELTPAL LYQLLSADIA GQRGQIGYAM EVYLQAAGET RDPRLAERAT RIALFARDIS AATQAARLWV RADPSNGDAR QALVSLLLGQ QQYNEAENHL EHLVALSPES GERTFLKIAT MLAGSADPET ALALMGNLSA FQANDPDALY GYAYLALQLK QLDLALSTVE RVIIQRPEAD RPLIMRARIL QQQGREEMAL ESLETVIENE EASIPLRLAY GQMLMEAGQV AKAERLFEQL EQAQPENPDV LLAQGLLAME RLEYKPAEDY FQRLLKLGQN VDQARFYLGR LAELQSNAGK AIDWYASITG GRLMVDAQVR QAVVTAQQGN LPAARQHLQL LSRKFPTQAD RFQLAEGEIL INAGRYEEAM SHYDNALHSR PDDTNLLYAR ALVAENLGRL DIAEQDLQRV ITLEPSNAEA LNALGYTLAD RTRRLEEALR YISRAMRLKP NNAFILDSMG WVHYRLGNYD KAEKYLREAM ELRKDPEIAA HLGEVLWAKG DREGARQVWQ HTLKMNQGNK VLLEVMQRFQ E
|
| |