Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2765 |
Symbol | |
ID | 3705303 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3135929 |
End bp | 3138184 |
Gene Length | 2256 bp |
Protein Length | 751 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637739241 |
Product | TPR repeat-containing protein |
Protein accession | YP_344742 |
Protein GI | 77166217 |
COG category | [N] Cell motility [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3063] Tfp pilus assembly protein PilF |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGCCCCCGA CCTCTCGATA CGGCCCGCAC CCCAACCGGC CCGCGGCGAA CAAGCAGTTT ACCAATCGGG ATGGGCCGAT TAAAGCCTTT CAAGCGGCCC GCCAGGCCCT CGCACCCGAC CAGCATGATG TGCTGACGTT CCATGGCGTT GGCGGGCAGG GCAAAACAGC CCTGGGGAAG AAATTTAGGC AAATCCTGGA GGAAGAGACG CCCCGTGCCG CGCTTTGGGG CCACGTGGAT TTAGCGGACA CGCACCTGCC TACGCCCGAC CGGATTCTTC TCGAATTACG CCGCTCGCTC CGGCGCTCGG GAAAAATAGA GTTTCCCGCT TTCGATGTCG CCTTTGCCCA TTACTGGCAG CAAGCTTACC CGCATTTAAA ATTTCAGGAC ACCCATCGGG ATTTGCTCAC CGATAAGGAA GGGGTTTTTG CCCACGCTCT GGAGTCGGCG AGTAGCTACG CGGAAGCCTT ACCGGCGGGC ATTGGCCTTA CCATGATAGC GCTGAATTGG GTGCGCCAAC GAATGCGCGA TTTTGATAAC CGCTATCGTC CCGCCCTCAG GGGTTTGGAA ACCTTCAGCC CCACGGAGAT TCTCGACCGC CTTCCTTATT ACCTCGGCCT CGATTTATGC GGCTATCAGC GCCGGCAAAG GGCAAAACAG CTCATTGTTT TTTTAGATAC CTATGACGCC TTATGGGAAA CCTTCACCCA GGGCGCCCGC TGGGAGGTGG ATGCCTGGGC GCGGGAGCTG GTCGCGAGCG CCCCCGGCGT GCTGTTTGTG GTCTTCGCAA GAGAGAAACT CACCTGGGAT ACCCATGAAC CCGATGCGTG GCGGGGATGT CTTCACCATC AGCCCCCCCT TGCGCCGCTT ACTGATGAGG ATGCCGACAC GTTTTTGCGC CAAATTCCGA TAGAAGAGGA GGCCGTGCGC CAGACCATTA TCCAAGCGGC CGAGGGCCAT CCGTTTTATC TCGACCTGGA AGTGGAGCAC TATCTCGATT TAGTGGCTGA CGGCGCAACA CCTCAACCGG AGCACTTTGG CCAGACAAAG CAAGCGATTC TGGCGCGATT TTTACGCTAT CGGGCAGGCG AAGAACAAGC CACCCTCAAG GTGTTATCGG TGGTCCGCTC CTTTGACTTT GAGCTATTGG AAGCGCTCGT CCGAGCGTTC CAGACGGGCT TTCCTCCCAC TGAATTCTTT AATTTTGTAC GCTACTCTTT TGTAGCGCCA GAAAGCGGCG AGCGGTATCG CTTGCATTCG CTGATGCAAG CGCATCTTCT GGAGAGCTTG GAGCCAGCGC TTCAAAAAAA GATCCATGAC TTTGCGTTTA CCTATTATGA CGCGCGATGC CAACCCGAAT CCCCCAAAGA CCTTCAGCCA GCCCATGAAA TCGCGTTAGT CGAAGCTTTT TACCACCGGG ATATGACCGA TCCCCAGGAG GCGGCCGCAT GGATTAACCA ACGGACAACT GTCTTTTACG AAGCGGCCCG GTATTCGTTA GTCGAACCCC TCTACCAACG GGCCTTGGCT ATTCGTGAAC AGATCCTCGG CCCCGACCAT CCCGATACTG CCACCAGCCT CAATAACCTG GCGGAATTGT ATCGCGCCCA GGGGCGCTAT GCCGAGGCCG AGCCCCTCTA TCAACGGGCC TTGGCTATTT GTGAACAGGT CCTCGGCCCC GACCATCCCG ACACTGCTCG AAGCCTCAAT AACCTGGCGG GGCTGTACAA GGCCCAGGGA GACTATGGGC AGGCCGAGCC CCTCTACCGG CGGGCCTTGG CTATTTGTGA ACAGGCCCTC GGCCCCGACC ATCCCCACAC TGCCACCAGC CTCAATAACC TGGCGGGGCT GTACGATAGT CAGGGACACT ATGGGCAGGC CAAGCCCCTC TATCAACGGG CCTTGGCTAT TTATGAAAAA ACCCTCGGTC CCGACCATCC CCGCACCGCC ACCAGCCTCA ATAACCTGGC GGCGCTGTAC GATACGCAGG GAGATTATGC GCGGATCGAG CCCCTCTATC AACGGGCCTT GGTTATTCAT GAAAAAACCT GCGGTCCCGA CCATCCCCGC ACCGCCACCA GCCTCAATAA TCTCGCGGGG CTGTATAAGG ATCAGGGAAG CTATGCCCAG GCCGAGCCCT TTTATCAACG GGCCTTGAGT ATTTGTGAAA AAAACCTCGG CCCCGACCAT CCCGATACCC ATACGGTGCG TCAAAATTAT CAAGCGCTGC TTGCCGCCAT GGGGCAGCAG AAATAG
|
Protein sequence | MPPTSRYGPH PNRPAANKQF TNRDGPIKAF QAARQALAPD QHDVLTFHGV GGQGKTALGK KFRQILEEET PRAALWGHVD LADTHLPTPD RILLELRRSL RRSGKIEFPA FDVAFAHYWQ QAYPHLKFQD THRDLLTDKE GVFAHALESA SSYAEALPAG IGLTMIALNW VRQRMRDFDN RYRPALRGLE TFSPTEILDR LPYYLGLDLC GYQRRQRAKQ LIVFLDTYDA LWETFTQGAR WEVDAWAREL VASAPGVLFV VFAREKLTWD THEPDAWRGC LHHQPPLAPL TDEDADTFLR QIPIEEEAVR QTIIQAAEGH PFYLDLEVEH YLDLVADGAT PQPEHFGQTK QAILARFLRY RAGEEQATLK VLSVVRSFDF ELLEALVRAF QTGFPPTEFF NFVRYSFVAP ESGERYRLHS LMQAHLLESL EPALQKKIHD FAFTYYDARC QPESPKDLQP AHEIALVEAF YHRDMTDPQE AAAWINQRTT VFYEAARYSL VEPLYQRALA IREQILGPDH PDTATSLNNL AELYRAQGRY AEAEPLYQRA LAICEQVLGP DHPDTARSLN NLAGLYKAQG DYGQAEPLYR RALAICEQAL GPDHPHTATS LNNLAGLYDS QGHYGQAKPL YQRALAIYEK TLGPDHPRTA TSLNNLAALY DTQGDYARIE PLYQRALVIH EKTCGPDHPR TATSLNNLAG LYKDQGSYAQ AEPFYQRALS ICEKNLGPDH PDTHTVRQNY QALLAAMGQQ K
|
| |