Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sterm_3300 |
Symbol | |
ID | 8598752 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Sebaldella termitidis ATCC 33386 |
Kingdom | Bacteria |
Replicon accession | NC_013517 |
Strand | + |
Start bp | 3473944 |
End bp | 3475203 |
Gene Length | 1260 bp |
Protein Length | 419 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | |
Product | Anthranilate synthase |
Protein accession | YP_003310071 |
Protein GI | 269121894 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 4 |
Plasmid unclonability p-value | 0.0000317164 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | n/a |
Fosmid unclonability p-value | n/a |
Fosmid Hitchhiker | n/a |
Fosmid clonability | n/a |
| |
Sequence |
Gene sequence | ATGATCATAA AACCAAAGTG TGAAGCTGCT GACTGCTTCA AATATATAAA GAAAGTTTTT CCTAACAGCT ATCTTGCCGA AGATAACAGA CAGATAATAA TAGGAATTGA TGTTGTCTAT TTTGATTCTG ACACATATAC CTATTCAGAT CTGCGTGAAT TGGTAAAATC AAGAAAAAGC ATTGCAGAAT TTTCCGGATT GTTCGGAGTA TTCTCATACG AGACTATCCA TTATTTTGAA AAGATAAACA GAAAAGAAAA AGCGGAATAT GACTATCCTG AATTTATATT TTCCGATGCC GGAGCCTATC TTCATTTTGA CAAAAAAAAT TCTGAATTCA GCTTTTTTGG AGATACAGAA AAATATTCAG ATTTACTTTT ACAACTGAAA GATGTGGCGT CTGCCGAAAA GTCCGAAAAT AATAAAAGCT TCAAAATTCT CAGTAATGAA TCCGAGAAAA AAAATATTTT TCTGAAGAAT GTTAATGAAG CCAAGGAATA TATAAAAAAA GGCGATATTT TCCAAATTGT GCTGAGTTCT CAGATAATAA TCGAATCAGA CTATGATCCT TATGATTTCT ATATGGAACT TACAGAAAAA AATCCTTCGC CGTATATGTT TTATTTCCCC ACACCATACG GAACAGTCAT AGGCTCAAGT CCGGAAATAC TTTTGAAAAT AGAGGACAAA CAGATTTTCA TAGCTCCCAT TGCAGGAACA AGACCCAGAG GCCGTGACCC GGAGGAAGAC AAAATATTAG CCCGTGAACT TTTAAATGAT GATAAAGAGC TGGCAGAACA CAGAATGCTT ATAGATCTTG CACGAAATGA CATAGGAAAG TTCAGCAGCC CCGGCAGTGT GAAAGTAAAG AATCCTATGC ATGTAGAATA CTTCCAGCAT GTTATGCACA TTGTAAGCGA CGTTTACGGA GAGCTTGCTG ACGGAACAGA TATTTTTGAC GTTATTTCTA CTGCTTTTCC AGCGGGAACC TTGAGCGGAG CCCCGAAAAT CAGAGCAATG GAAATAATTG CAGAGCTTGA ACTCCATAAG AGAAATATCT ACGGCGGCGG AATAGGCTTT CTTCATTATA ACGGGAATTC ACAGATCGCC ATTATTATAA GAACAGCATT TTATAAAGAT AAAAAATACT ATATACAGTC AGGAGCCGGA ATCGTATATG ATTCAGATCC GGAAAAGGAA TATCTTGAAA TACTGCATAA AAGAAAGTCG CTCACAGGTA TACTGACTGA TTGTAAATAA
|
Protein sequence | MIIKPKCEAA DCFKYIKKVF PNSYLAEDNR QIIIGIDVVY FDSDTYTYSD LRELVKSRKS IAEFSGLFGV FSYETIHYFE KINRKEKAEY DYPEFIFSDA GAYLHFDKKN SEFSFFGDTE KYSDLLLQLK DVASAEKSEN NKSFKILSNE SEKKNIFLKN VNEAKEYIKK GDIFQIVLSS QIIIESDYDP YDFYMELTEK NPSPYMFYFP TPYGTVIGSS PEILLKIEDK QIFIAPIAGT RPRGRDPEED KILARELLND DKELAEHRML IDLARNDIGK FSSPGSVKVK NPMHVEYFQH VMHIVSDVYG ELADGTDIFD VISTAFPAGT LSGAPKIRAM EIIAELELHK RNIYGGGIGF LHYNGNSQIA IIIRTAFYKD KKYYIQSGAG IVYDSDPEKE YLEILHKRKS LTGILTDCK
|
| |