Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0648 |
Symbol | |
ID | 3706880 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 694894 |
End bp | 697620 |
Gene Length | 2727 bp |
Protein Length | 908 aa |
Translation table | 11 |
GC content | 56% |
IMG OID | 637737156 |
Product | hypothetical protein |
Protein accession | YP_342697 |
Protein GI | 77164172 |
COG category | [U] Intracellular trafficking, secretion, and vesicular transport |
COG ID | [COG3451] Type IV secretory pathway, VirB4 components |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.685695 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGACGAATT CAAAATTCCC CGATACTTCG GACGCTTCGC CCTTAACTTC CGATTCGATA AAACAGCACT ATCATCGCCC TCCGTCGTTT ACCGACCTGT TGCCCTGGAT GGAATACCTC CCCGAGAGCA AAACCTTTCT GTTGGAGGAC GGCGTCAGTC TGGGAGCCTT GTTCGAGGTG CTGCCGGTCG GTTGCGAAGC GCGCACGCCA GCGTTTATGA CACAGTTGCG CGATGCTATT CAAATCGCTA TCAACGAGGC CATTCCCGAG CGCGATGATG CGCCCTGGGT GTTACAGATC TATGTGCAAG ATGAGCCCAA GCTAACCCAA TTGAATGCGT TGCTGGCCAG TTATCCGTCG CCGACGGTTC GTGCATCGGA TTATTCCCAG CATTACCAGT CGGTGATGTC GCAACACTTG CAGGCGATTT CTCGACCGGG TGGGTTGTTT GACGATACCA CCGTCACCGG TTCGCGTTGG CAGGGGCAGC AACGTCGGAT CCGTGTAGTG CTCTACCGTC GTTTGATAAG TAATGGCAAA TCCCCAGCCG CTATAGAGGT GGAGGAGGCG ATCAACGATG TGGCGACCAA ATGGATCGCC TCATTGGCCT CTGCCGGGAT AAAGGTCCAG CGTGCCAATG GTCAGGCGTT TTATCAGTGG TTGCTGAGCT GGTTCAATCC CAATCCACCG TTAGCAGACG GTAATCCGGA AGCACTTCTG CAGCTGGCGC CATATCCGGG CGATGAAAAC TTACCGTTCG GCCATGACTT TGCTGAGCAA CTGACCCTCT CGATGCCACG CTCCGACGAG GCAACAGCGA GTTGGGTTTT CGATAATATG CCGCACACTG TAGTGACCAT CCAAAGCTTA CGGCGGGCAC CGGACATTGG GCACTTCACG GCGGAGCGCC AGGCCGGGGA TCAGGTGTTT TCGCTATTTG ATCGTCTGCC CGAGCACACT ATCATGGCGA TCACACTCAC TCTCAAGCCG CAGGACACCA CCCGTAATCA CATCGCCCAG ATCAAACGCG CATCGGTGGG TGATTCCGCT GAAGCGTCGA TTACTCGGGA AGATGCGGAA CACGTGGAGC GGGAGATGGC ACAGGGCAAC AAGCTGTATC CCGTGAGCAT GGCCTTCTAT GTGCGGGGCG ATAATCAAAA GTCGCTGCGT AACAATTTGA ATCGACTCCA TGCGCTGCTC TTACCCAATG GCCTGCAGCC CATTGCCCAG GAAGCGGACC TACTGTCGCT CGACAGCTAT ATCCGCAACT TGCCCATGGC CTATGACGCA GACCTGGACA AATCCCGTCG TCGCTCGCGG CTGATCTTCT CTCGCCACAT TGCCAATTTA CTGCCGTTCT ACGGTCGTTC CCGTGGGACC GGTCATCCGG GCCTGGTATT CTACAACCGA GGTGCCGAAC CTTTGGTCTT CGACCCGCTG CATCGGGATG ACCGTAAGAA AAATGCCCAC ATGCTGATCC TGGGGCCGAC GGGTGCGGGG AAGTCGGCGC TACTGGTGTA TCTGCTGCAG CAGATGATGG CACGGCACCG GCCACGTATT TTTATCATCG AGGCCGGTGC TTCGTTTTCG CTGTTGGGAC AGCATTTTGC GCACCACGGT CTGTCAGTAA ACCAGATAAC CCTGAACCCC AATGTCGATG TCAGCCTGCC GCCCTTTACC GATGCGCTTC GACTGCTGGA TCGGCGCCAT GCGTTTAATC CGCTTAGTGT CGATGAATCC ACAATGGAGG AGACGTTAGA CGAAGATGAT GAGATCGAGG AGGAAGGGGG CGGTCGCGAT ATTCTCGGTG AGATGGAGAT TGCTGCACGC ATCATGATTA CCGGTGGTGA TGAACGCGAA GACGCGCGAT TGACCCGGGC CGATCGACTG CTGATCCGCA ATGCCATATT CCTTGCCGCA AAAACCGTGA AAGAGACCAG CCGAACCCAG GTAATTACCC AGGATGTGGT CAACGCCTTT CAGACCATCG CCACCAACGC GGAGTTACCC GAACACCGGC GTAATCGTGC ACTCGAGATG GGCGATGGTA TGGCGCTGTT CTGCTCGGGC CTGGCTGGCC ACTTCTTCAA CCGACCGGGA CAGTCCTGGC CTGCGGTCGA TGTCACTATC CTCGAAATGG GCATGCTGGC AAGAGAGGGC TACGAGGATC AGCTCACTGT CGCCTACCTG TCGATGATGA GTCACATCAA CGACCTGGTG GAACGCCACC AGCACGACGA TCGGCCCACA CTGGTGGTCA CGGACGAAGG CCACATTATT ACCACCAATC CCTTGCTGGC CCGTTACGTG GTCAAGATCA CCAAGATGTG GCGCAAGCTC GGTGCCTGGT TTTGGATCGC CACCCAGAAT CTTGAAGATT TCCCGGACGC CAGCCGCAAG ATGCTCAACA TGATGGAGTG GTGGCTGTGC CTGGTCATGC CGAAAGAGGA GGTGGAACAA ATCGCCCGCT TCAAGGACCT GAATGACGAA CAACGTAACC TGCTGCTGTC TGCCCGCAAG GAGCCGGGAA AATACGTGGA GGGTGTTGTG CTGGCCGACA AAGTCGAAGC CCTGTTTCGC AATGTTCCAC CAGCTTTGTC ACTCGCCCTG GCGATGACCG AAAAGCACGA GAAAGCAGAG CGGGCTGCCA TTATGCGTGA GAAAAACTGC TCGGAGCTGG AGGCCGTTTA TGAAATCGCC CAGCGTATCG AACAGACTCG CGCGTAG
|
Protein sequence | MTNSKFPDTS DASPLTSDSI KQHYHRPPSF TDLLPWMEYL PESKTFLLED GVSLGALFEV LPVGCEARTP AFMTQLRDAI QIAINEAIPE RDDAPWVLQI YVQDEPKLTQ LNALLASYPS PTVRASDYSQ HYQSVMSQHL QAISRPGGLF DDTTVTGSRW QGQQRRIRVV LYRRLISNGK SPAAIEVEEA INDVATKWIA SLASAGIKVQ RANGQAFYQW LLSWFNPNPP LADGNPEALL QLAPYPGDEN LPFGHDFAEQ LTLSMPRSDE ATASWVFDNM PHTVVTIQSL RRAPDIGHFT AERQAGDQVF SLFDRLPEHT IMAITLTLKP QDTTRNHIAQ IKRASVGDSA EASITREDAE HVEREMAQGN KLYPVSMAFY VRGDNQKSLR NNLNRLHALL LPNGLQPIAQ EADLLSLDSY IRNLPMAYDA DLDKSRRRSR LIFSRHIANL LPFYGRSRGT GHPGLVFYNR GAEPLVFDPL HRDDRKKNAH MLILGPTGAG KSALLVYLLQ QMMARHRPRI FIIEAGASFS LLGQHFAHHG LSVNQITLNP NVDVSLPPFT DALRLLDRRH AFNPLSVDES TMEETLDEDD EIEEEGGGRD ILGEMEIAAR IMITGGDERE DARLTRADRL LIRNAIFLAA KTVKETSRTQ VITQDVVNAF QTIATNAELP EHRRNRALEM GDGMALFCSG LAGHFFNRPG QSWPAVDVTI LEMGMLAREG YEDQLTVAYL SMMSHINDLV ERHQHDDRPT LVVTDEGHII TTNPLLARYV VKITKMWRKL GAWFWIATQN LEDFPDASRK MLNMMEWWLC LVMPKEEVEQ IARFKDLNDE QRNLLLSARK EPGKYVEGVV LADKVEALFR NVPPALSLAL AMTEKHEKAE RAAIMREKNC SELEAVYEIA QRIEQTRA
|
| |