Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_0468 |
Symbol | |
ID | 3706639 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 503143 |
End bp | 504471 |
Gene Length | 1329 bp |
Protein Length | 442 aa |
Translation table | 11 |
GC content | 54% |
IMG OID | 637736977 |
Product | hypothetical protein |
Protein accession | YP_342521 |
Protein GI | 77163996 |
COG category | [S] Function unknown |
COG ID | [COG4325] Predicted membrane protein |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 35 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGAATAAAC TGACTTATCT ATGGGAAGCT CTACGCGAAA GCTTGTGGTT CATCCCGATG TGTATTGTCA TCGGTGCAGT CCTCCTTGCA TTGGGGCTGA TCGAGCTAGA GGCGGCTGTA GAACGCGAGC AGCTGGCAGA GCATTGGCCC ACACTATTCG GAGTTGGAGC GGATGGCTCG CGCGCGCTGC TTTCAGCGAT CGCCAGCTCC ATGATCACCG TTGCTGGCGT TACCTTTTCG ATCACGGTCG TCGCTCTGTC TTTGGCCTCG AGTCAGTATA CGTCACGCAT CTTACGAAAT TTTATGCGGA ATCGGAGCAA CCAAGCAGTC CTCGGCGTCT TTGTGGGCGT CTTTGCTTAT TGTCTCGTGG TGCTGCGGAC GATTCGCGGT GGCGATCAAG GGGTATTTGT CCCGGGATTA GCGGTACTCG GTGCCCTACT GCTGGCATTC GTAGCCATCA GCTTTCTCAT CTTTTTCATC CATCATATTG CCACCTCAAT TCAAGCTACA AGTATTATTA AGTCGGCAGC CACAGAGACT CTTGAAGCCA TTGACCGTTT GTTCCCCACG GAGATAGGTG AGGCCACCGC CGAGCATGTG GGTAATGCTC CAGGAGCAAG AGTGGACTTG GCGGCCCAGG CGTGGATAAC TATCCCCGCT CAACAATCTG GCTATATTCA AGGGATCGAC GCCGATGCCC TGCTTCACGT CGCCTGCGCT CAGGACATCA TCGTGCGGAT GGAAAAGGAA ATTGGCGAGT TCGTCATCGA AGCTTCCCCC CTTGTGTCCG TAACTGGCAA GTCGCCAGGT GATGAGACCA TTTGGGAGCT GAACGCAGCC TACACGATAG ACTGGCGCCG CGCGGTGGAG CAGGACGCGA CCTACGGTAT CCAACAGATC GTGGATGTGG CGCTCAAGGC GCTGTCACCC GGCATCAACG ACACCACCAC CGCAATTATT TGTGTTGACT TCCTAGGGAT GATTCTTGCA CGTCTCATCG CCCGGCACAT TGAAACACCC TACCACTCTG ATAACGGGCA GCTACGTCTC ATTACTCGCG GTCCCACTTT TTCTAATTTA CTATCCCAAG CCTTTGATCA AATTCGCCGG AACGCCGAAG GTAATGTTGC CGTCCTCATT CGGCTTCTTC AGAGTCTGGA AACGCTTACC AAGTTAACGG TGAACGTACA ACGGCATCAG GCGCTCCGTC AACAAGCCGC TCTCATCATT GAAACAGCCG AGCGTACTGT CCCCATGGCC TATGATCGCA TGCCCATTCA AGCAATCCGA GATCGCATCT TTCCACTGCC GGCCGATGAA AGCCGGTAA
|
Protein sequence | MNKLTYLWEA LRESLWFIPM CIVIGAVLLA LGLIELEAAV EREQLAEHWP TLFGVGADGS RALLSAIASS MITVAGVTFS ITVVALSLAS SQYTSRILRN FMRNRSNQAV LGVFVGVFAY CLVVLRTIRG GDQGVFVPGL AVLGALLLAF VAISFLIFFI HHIATSIQAT SIIKSAATET LEAIDRLFPT EIGEATAEHV GNAPGARVDL AAQAWITIPA QQSGYIQGID ADALLHVACA QDIIVRMEKE IGEFVIEASP LVSVTGKSPG DETIWELNAA YTIDWRRAVE QDATYGIQQI VDVALKALSP GINDTTTAII CVDFLGMILA RLIARHIETP YHSDNGQLRL ITRGPTFSNL LSQAFDQIRR NAEGNVAVLI RLLQSLETLT KLTVNVQRHQ ALRQQAALII ETAERTVPMA YDRMPIQAIR DRIFPLPADE SR
|
| |