Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Noc_2494 |
Symbol | |
ID | 3704379 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Nitrosococcus oceani ATCC 19707 |
Kingdom | Bacteria |
Replicon accession | NC_007484 |
Strand | + |
Start bp | 2839658 |
End bp | 2841136 |
Gene Length | 1479 bp |
Protein Length | 492 aa |
Translation table | 11 |
GC content | 49% |
IMG OID | 637738973 |
Product | anthranilate synthase component I |
Protein accession | YP_344477 |
Protein GI | 77165952 |
COG category | [E] Amino acid transport and metabolism [H] Coenzyme transport and metabolism |
COG ID | [COG0147] Anthranilate/para-aminobenzoate synthases component I |
TIGRFAM ID | [TIGR00564] anthranilate synthase component I, non-proteobacterial lineages |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 0.184686 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGGAAGCCC AACAATTCAA ACAGCTTGCT GCTCAAGGTT ATAATCATAT TCCCCTCATG CGGGAGGTTT TAGCTGACCT AGATACTCCA CTTAGCACTT ACCTCAAACT GGCTAATGGC CCTTACTCCT ATTTATTAGA ATCCGTCCAT GGCGGCGAGA AATGGGGCCG ATATTCATTT ATCGGCTTAC CTTGCCGCAC GGTAGTTAAG GTGCAAAAAT ATGAAATAGT CGTTGAAACC GATCACCGCA TTGAGGAAAC TCATCACAGC GATGATCCCC TGGCTTGGAT AGAAACCTTT AAGCAACGCT TTAAAGTCCC GGCCATTGCC TCTTTGCCTC GCTTTACAGG CGGCCTCGTA GGATACTTTG GCTATGATAC GATCCGCTAT ATCGAACCTA AGCTTGCCCA TTGGAAGAAG CCCGATTCCT TGGAAACGCC GGATATACTA TTACTTGTTT CCAACGAAAT TGTTGTTTTT GACAACCTGA GTGGCAAGCT CTACTTCATT ATTCATTGCG GCCCCGAAGA CTATACAGAG GGCCTGCAAC GCTTGGATGC CCTAGAAGAT CGCCTACGGG CAAGCGTCCC TGCCCACCAT ACCGTTACTC CCTCCCGCCT AGTCCTAGAA GATGATTTTA TCTCTGGATT TACGGAGCAA GGCTTTAAGG GGGCGGTTGA TAAAACGCGT CAATACATTA CTGACGGTGA TGTGATGCAA GTAGTGCTCT CACAACGACT CTCGGTGCCT TTTTCTGCCT CTCCCTTAAA CCTATACCGG GCGCTACGTT GCCTCAACCC TTCTCCCTAC ATGTACTATT TGAATCTAGA GGATTTCCAT GTGGTGGGCT CATCTCCCGA GATCCTAGTA CGCTTAGAGG ATGGTGCCGT TACCGTACGC CCCATCGCTG GCACCCGGCA TCGTGGCAGA GGCGAAGAAG AAGACCGAGC GCTAGAGCAA GAATTACTCG CTGACCCTAA GGAATTAGCG GAACACCGGA TGCTTATCGA TCTTGGCCGA AATGATGTTG GCCGGATAGC CACCATTGGC AGTGTCAACG TAACCGAAAA AATGCTTATC GAACGCTATT CTCATGTCAT GCATATCGTC TCTAATGTAA CCGGTCAACT TAAACCGGAG TTCTCGGCCA TGGACGTTTT ACGTGCAACT TTTCCTGCAG GTACGGTTTC TGGCGCTCCC AAAATTCGAG CGATGGAAAT CATTGATGAA TTAGAACCTA TTCAGCGGGG AGTCTATGCG GGTGCCGTGG GCTATCTTGC TTGGTCAGGC AATATGGATA CAGCAATCGC TATTCGTACC GCAATCATTA AAGATCAGAT TCTTCATATT CAAGCAGGTG CGGGAATCGT CTATGACTCG GTACCGCAGA GCGAGTGGGA AGAAACCCTG AATAAAGGGC GAGCTATCTT TCGAGCAGTG GCTTTGGCCG AAGCTGGCCT AGATGGTTCC TTCCGCTAG
|
Protein sequence | MEAQQFKQLA AQGYNHIPLM REVLADLDTP LSTYLKLANG PYSYLLESVH GGEKWGRYSF IGLPCRTVVK VQKYEIVVET DHRIEETHHS DDPLAWIETF KQRFKVPAIA SLPRFTGGLV GYFGYDTIRY IEPKLAHWKK PDSLETPDIL LLVSNEIVVF DNLSGKLYFI IHCGPEDYTE GLQRLDALED RLRASVPAHH TVTPSRLVLE DDFISGFTEQ GFKGAVDKTR QYITDGDVMQ VVLSQRLSVP FSASPLNLYR ALRCLNPSPY MYYLNLEDFH VVGSSPEILV RLEDGAVTVR PIAGTRHRGR GEEEDRALEQ ELLADPKELA EHRMLIDLGR NDVGRIATIG SVNVTEKMLI ERYSHVMHIV SNVTGQLKPE FSAMDVLRAT FPAGTVSGAP KIRAMEIIDE LEPIQRGVYA GAVGYLAWSG NMDTAIAIRT AIIKDQILHI QAGAGIVYDS VPQSEWEETL NKGRAIFRAV ALAEAGLDGS FR
|
| |