Gene Sterm_3300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3300 
Symbol 
ID8598752 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3473944 
End bp3475203 
Gene Length1260 bp 
Protein Length419 aa 
Translation table11 
GC content36% 
IMG OID 
ProductAnthranilate synthase 
Protein accessionYP_003310071 
Protein GI269121894 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000317164 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATAA AACCAAAGTG TGAAGCTGCT GACTGCTTCA AATATATAAA GAAAGTTTTT 
CCTAACAGCT ATCTTGCCGA AGATAACAGA CAGATAATAA TAGGAATTGA TGTTGTCTAT
TTTGATTCTG ACACATATAC CTATTCAGAT CTGCGTGAAT TGGTAAAATC AAGAAAAAGC
ATTGCAGAAT TTTCCGGATT GTTCGGAGTA TTCTCATACG AGACTATCCA TTATTTTGAA
AAGATAAACA GAAAAGAAAA AGCGGAATAT GACTATCCTG AATTTATATT TTCCGATGCC
GGAGCCTATC TTCATTTTGA CAAAAAAAAT TCTGAATTCA GCTTTTTTGG AGATACAGAA
AAATATTCAG ATTTACTTTT ACAACTGAAA GATGTGGCGT CTGCCGAAAA GTCCGAAAAT
AATAAAAGCT TCAAAATTCT CAGTAATGAA TCCGAGAAAA AAAATATTTT TCTGAAGAAT
GTTAATGAAG CCAAGGAATA TATAAAAAAA GGCGATATTT TCCAAATTGT GCTGAGTTCT
CAGATAATAA TCGAATCAGA CTATGATCCT TATGATTTCT ATATGGAACT TACAGAAAAA
AATCCTTCGC CGTATATGTT TTATTTCCCC ACACCATACG GAACAGTCAT AGGCTCAAGT
CCGGAAATAC TTTTGAAAAT AGAGGACAAA CAGATTTTCA TAGCTCCCAT TGCAGGAACA
AGACCCAGAG GCCGTGACCC GGAGGAAGAC AAAATATTAG CCCGTGAACT TTTAAATGAT
GATAAAGAGC TGGCAGAACA CAGAATGCTT ATAGATCTTG CACGAAATGA CATAGGAAAG
TTCAGCAGCC CCGGCAGTGT GAAAGTAAAG AATCCTATGC ATGTAGAATA CTTCCAGCAT
GTTATGCACA TTGTAAGCGA CGTTTACGGA GAGCTTGCTG ACGGAACAGA TATTTTTGAC
GTTATTTCTA CTGCTTTTCC AGCGGGAACC TTGAGCGGAG CCCCGAAAAT CAGAGCAATG
GAAATAATTG CAGAGCTTGA ACTCCATAAG AGAAATATCT ACGGCGGCGG AATAGGCTTT
CTTCATTATA ACGGGAATTC ACAGATCGCC ATTATTATAA GAACAGCATT TTATAAAGAT
AAAAAATACT ATATACAGTC AGGAGCCGGA ATCGTATATG ATTCAGATCC GGAAAAGGAA
TATCTTGAAA TACTGCATAA AAGAAAGTCG CTCACAGGTA TACTGACTGA TTGTAAATAA
 
Protein sequence
MIIKPKCEAA DCFKYIKKVF PNSYLAEDNR QIIIGIDVVY FDSDTYTYSD LRELVKSRKS 
IAEFSGLFGV FSYETIHYFE KINRKEKAEY DYPEFIFSDA GAYLHFDKKN SEFSFFGDTE
KYSDLLLQLK DVASAEKSEN NKSFKILSNE SEKKNIFLKN VNEAKEYIKK GDIFQIVLSS
QIIIESDYDP YDFYMELTEK NPSPYMFYFP TPYGTVIGSS PEILLKIEDK QIFIAPIAGT
RPRGRDPEED KILARELLND DKELAEHRML IDLARNDIGK FSSPGSVKVK NPMHVEYFQH
VMHIVSDVYG ELADGTDIFD VISTAFPAGT LSGAPKIRAM EIIAELELHK RNIYGGGIGF
LHYNGNSQIA IIIRTAFYKD KKYYIQSGAG IVYDSDPEKE YLEILHKRKS LTGILTDCK