Gene Sterm_3467 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_3467 
Symbol 
ID8598918 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp3672087 
End bp3673514 
Gene Length1428 bp 
Protein Length475 aa 
Translation table11 
GC content35% 
IMG OID 
Productglycoside hydrolase family 1 
Protein accessionYP_003310237 
Protein GI269122060 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.0716612 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGTATTGA AGAAGTCTTT TTTATGGGGT GGAAGTATAG CGGCCCATCA ATGCGAGGGA 
GCGTGGGATA AAGACGGCAA AGGTGTGGGA ATTATGGATC TGGTAACACA GGGTACTTAT
GAAAAACCAA GAGAAATAAC TAAAACAATC GAGGAAGGTA AAATATATCC TTCTCATGAA
GGAATTGATT TTTATCATAA TTATAAAGAA GATATTGCTT TATTTGCCGA AATGGGTTTT
AAAGCACTGC GTATTTCTGT TGACTGGTCC CGTATATATC CAAAGGGTGA TGAAGAAAAA
GCAAATGCAC TGGGAATTCA GTTTTATCAA AATATTGTGG ATGAATTATT GAAGCATAAA
ATAGAGCCTG TAGTTACATT ATATCACTTT GAAATGCCTG TTCATTTAGT CACTGAATAT
GGTTCATGGA CAAATCGAAA AGTAATAGAT TTTTATTTAA AATTCTGTAA AACAATGTAT
GAAGCGTTAA AAGGAAAAGT TCACTACTGG TCTACATTTA ATGAAATGAA TCACATTGAT
CCCCAGACAG AAGCATCAGA TATTTTCACA TATATTATTG CGGGTTTAAA GTATTCTGAA
ATGGAGGATA AAAAGCAAAC ACTTGCGACA ATAGGATATA ACATGACTTT GGCAGGTGTA
AAAGCTGTAA AACTGGGTCA TGATATAGAT GCCGGCAATA AAATAGGATG TGTATTTGGT
CTGACACCGG TATATCCGTT TAATTGTAAT CCTGTAAATG TTATGAATGC TTTCAAAGAA
AACGAGCATG AATTTTATCA GATTGATGCA ATGTGCAACG GAAAATTTCC CGGTTATAAA
CTATGTGAAT ATCAGGAGCA GGGAATAAGT CTTGATATTA CAGATGAGGA TGAACAGGCC
TTTGCAGAAG GAAAAATAGA TTTCATTGGA TTGAATTATT ATTCTTCAAG TGTATCTCGT
TATGAAGGCA GTGAAAATGA TGAAGAGACT TTATTCGGCG GTATACAAAA TCCTTATCTG
GAAAAAAGCA GATGGGGCTG GTCCATTGAT CCTGTCGGTT TGAGATATAT ACTAAATTAT
GTTTATCGCA GATACGGTCT TCCTGTCATT ATTACCGAAA ATGGTCTGGG AGCTGTAGAT
GAACCGGATA AAAACGGAAA TATCGAGGAT AATTACCGTA TTGAATATCT GCAAAAGCAT
ATTGAGCAAA TAAAAAAAGC TGTTATTGAA GATCATGTCG AATGCTTCGG CTATCTGACA
TGGGCACCGA TTGATCTGGT AAGTGCAACA ACCGGTGAAA TGAAGAAGCG GTATGGTTTT
ATTTATGTAG ATAAACATGA TAACGGCAAA GGCACTTTGG AACGGAAAAA GAAAAAATCT
TTTTACTGGT ATAAAAATGT CATAAAATCA AATGGCAGTG AATTATAA
 
Protein sequence
MVLKKSFLWG GSIAAHQCEG AWDKDGKGVG IMDLVTQGTY EKPREITKTI EEGKIYPSHE 
GIDFYHNYKE DIALFAEMGF KALRISVDWS RIYPKGDEEK ANALGIQFYQ NIVDELLKHK
IEPVVTLYHF EMPVHLVTEY GSWTNRKVID FYLKFCKTMY EALKGKVHYW STFNEMNHID
PQTEASDIFT YIIAGLKYSE MEDKKQTLAT IGYNMTLAGV KAVKLGHDID AGNKIGCVFG
LTPVYPFNCN PVNVMNAFKE NEHEFYQIDA MCNGKFPGYK LCEYQEQGIS LDITDEDEQA
FAEGKIDFIG LNYYSSSVSR YEGSENDEET LFGGIQNPYL EKSRWGWSID PVGLRYILNY
VYRRYGLPVI ITENGLGAVD EPDKNGNIED NYRIEYLQKH IEQIKKAVIE DHVECFGYLT
WAPIDLVSAT TGEMKKRYGF IYVDKHDNGK GTLERKKKKS FYWYKNVIKS NGSEL