Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2721 |
Symbol | |
ID | 3704747 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 3090347 |
End bp | 3091588 |
Gene Length | 1242 bp |
Protein Length | 413 aa |
Translation table | 11 |
GC content | 58% |
IMG OID | 637739203 |
Product | hypothetical protein |
Protein accession | YP_344704 |
Protein GI | 77166179 |
COG category | [S] Function unknown |
COG ID | [COG2995] Uncharacterized paraquat-inducible protein A |
TIGRFAM ID | [TIGR00155] integral membrane protein, PqiA family |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 34 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGACTTCT CTGCACAGGC CACGTTAGCC GTCTGCCATC ACTGCGACTG GGTGATGACC TTGCCACGAC TACGGGCGGG CGAGACCGCT TGCTGCCCCC GATGCGAACA CAAGTTACCT GGCCAACAGC ACACTTCAAT TCAAAGCCAA CTTGCCTGGG CCAGCGCGGC ATTAATCATG TTGGCAGCCG CGATCGCCTT TCCCTTCGTG AGCTTCGAGG TACAGGGGAT TAAACACACC ATTATCGTGG CAGATACCGC ACTAGCCCTC TTTGACTACG ATTTTCCATT CTTGGGACTG GTTGTGCTTA CGACCACGAT TTTGTTGCCA ACCGCCTATC TACTGGTCCT CCTTTATCTC CATGGAGTGC TGGCTTCGGG CCGCCGTCCC ATGGGGGCGC AAACACTAGC GCGGCTACTG ACAAGCATCA AACCTTGGGT GATGAGCGAT GTGTTTGTGG TCGGGGTGCT AGTGAGCATG ATCAAGGTGC TCTCCCTGGC GAGTCTGCAG CTCGGTCCTG CATTCCCGGC GTTTTGCGCC TATGCCGTGT TGCTGTTGAA GTCCATCTCT AGCTTCGATC CTGGAACCTT GTGGACGGCT ATCAGTGGCC CGGTCGATCC TCCGGTTGAT CTTGCCCCCG GCAGTCCTGC GGCTACTCAA GGAGCCGCAG GCTGCACCCG CTGCAATGCC ATCGTCAATA CTGCCAGCCA GACACGCTGC CCACGTTGCG GCTACCATCC TATCGCGCCC AACCCGCGCC GCTTGCAAGC AACCTGGGCA TTGCTCATCG CAGCAGGCAT CTTGTATATT CCAGCCATGG CCTATCCCAT CATGATCACC ACCGAGCTCG GTAGAACCTC ACCACAGACC GTAGTGGGCG GCGCCCGGCT TTTGCTGGAG ACCGGTTCCT GGCCCATCGC CCTGATAATC TTTACGGCGA GCATTGTGGT GCCTATCGGT AAGGTGCTCG CTCTTGGCTG GCTTTGTTTG CAGGCACAAG CGGGTACCGG GCGCAGCGCC TATGACCGAC TCAGGCTGTA CCGGCTTGTC GAGGCAATCG GCCGCTGGTC GTTCCTCGAC GTATTTGTAG TTGCCCTGTT GACCGCCCTC ATTCAGGCAG GTGAGCTTAT GCGCGTACAG CCCAGCCCCG GTGTGGTTAT CTTTGCAATT GTGGTGATCC TCACCATGCT GGCGGCAATG GCCTTTGACC CCCGCCTGAT TTGGCGGGTC CATGAAAAAT GA
|
Protein sequence | MDFSAQATLA VCHHCDWVMT LPRLRAGETA CCPRCEHKLP GQQHTSIQSQ LAWASAALIM LAAAIAFPFV SFEVQGIKHT IIVADTALAL FDYDFPFLGL VVLTTTILLP TAYLLVLLYL HGVLASGRRP MGAQTLARLL TSIKPWVMSD VFVVGVLVSM IKVLSLASLQ LGPAFPAFCA YAVLLLKSIS SFDPGTLWTA ISGPVDPPVD LAPGSPAATQ GAAGCTRCNA IVNTASQTRC PRCGYHPIAP NPRRLQATWA LLIAAGILYI PAMAYPIMIT TELGRTSPQT VVGGARLLLE TGSWPIALII FTASIVVPIG KVLALGWLCL QAQAGTGRSA YDRLRLYRLV EAIGRWSFLD VFVVALLTAL IQAGELMRVQ PSPGVVIFAI VVILTMLAAM AFDPRLIWRV HEK
|
| |