Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2938 |
Symbol | |
ID | 3706420 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | - |
Start bp | 3326815 |
End bp | 3327924 |
Gene Length | 1110 bp |
Protein Length | 369 aa |
Translation table | 11 |
GC content | 57% |
IMG OID | 637739415 |
Product | chorismate synthase |
Protein accession | YP_344913 |
Protein GI | 77166388 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0082] Chorismate synthase |
TIGRFAM ID | [TIGR00033] chorismate synthase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 25 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGTCTGGCA ATACTCTTGG CAAACTTTTT ACTGTTACCA CCTTTGGCGA AAGCCACGGG CCGGCGTTGG GCTGTATTGT AGATGGCTGC CCCCCGGGCT TGGCTTTATG CGAGACGGAT ATTCAAATTG ACCTGGATCG GCGCCGGCCC GGTAAATCCC GTCATACTAC CCAGCGCCGG GAACCGGATC AGGTCCAGAT CCTCTCCGGG GTGTTCGAGG GAAAAACCAC CGGCACTCCC ATCGGTTTGT TAATTGAAAA TGTCGATCAA CGCTCCCGGG ATTACGATAA AATCAAGGAG CAAATCCGGC CCGGCCATGC AGACTATACT TATTTGCAAA AATATGGCCT GCGGGATTAC CGGGGAGGGG GGCGTTCCTC GGCCCGGGAG ACGGCCATGC GGGTAGCTGC GGGCGCTATC GCCAAGAAAT ACCTGGCAGA GCGGCATGGC GTAAAAATTC GGGGGTATCT GGCTCAGCTT GGCCCCATTC GGGCTGAACG ATTCGACTGG GAAATCGTGG AGAAAAATCC CTTTTTCTGC CCGGACCCGG ATAAAATATC TGAACTTGAA GCCTATATGG ACGCTCTCCG GAAAGAAGGC GATTCCATTG GCGCCCGGAT TAACGTGGTG GCTACCGGAG TCCCTCCCGG TCTGGGCGAG CCGGTCTTTG ATCGCCTCGA TGCGGATTTG GCCCATGCCC TTATGAGCAT TAACGCCGTT AAGGGCGTGG AAATCGGTGT TGGTTTCGCC GCAGTGACCC AAAAGGGTAC TGACCATCGC GACCCCCTTA CCCCGGAAGG TTTCCTCAGT AACCATGCGG GCGGTGTCTT GGGGGGGATT TCCACGGGGC AGGATATTCT TGCTAGCATT GCGCTAAAAC CCACTTCCAG CCTCCGCTTA CCCGAGCGTA CCATTAACTG CCGGGGCGAG TCTGCGGAAG TCGTTACCAC GGGCCGCCAT GATCCTTGTG TTGGCATTCG GGCAACGCCT ATTGCCGAAG CCATGGCTGC CTTGGTGTTG ATGGACCATC TGCTGCGCCA CCGCGCCCAA AATATGGACG TTCAGCCGAG TTTGCCATCC ATCCCCGCTT ATCCCGGTGG TGGCGGCTAA
|
Protein sequence | MSGNTLGKLF TVTTFGESHG PALGCIVDGC PPGLALCETD IQIDLDRRRP GKSRHTTQRR EPDQVQILSG VFEGKTTGTP IGLLIENVDQ RSRDYDKIKE QIRPGHADYT YLQKYGLRDY RGGGRSSARE TAMRVAAGAI AKKYLAERHG VKIRGYLAQL GPIRAERFDW EIVEKNPFFC PDPDKISELE AYMDALRKEG DSIGARINVV ATGVPPGLGE PVFDRLDADL AHALMSINAV KGVEIGVGFA AVTQKGTDHR DPLTPEGFLS NHAGGVLGGI STGQDILASI ALKPTSSLRL PERTINCRGE SAEVVTTGRH DPCVGIRATP IAEAMAALVL MDHLLRHRAQ NMDVQPSLPS IPAYPGGGG
|
| |