Gene Sterm_1943 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSterm_1943 
Symbol 
ID8597410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameSebaldella termitidis ATCC 33386 
KingdomBacteria 
Replicon accessionNC_013517 
Strand
Start bp2072231 
End bp2074321 
Gene Length2091 bp 
Protein Length696 aa 
Translation table11 
GC content36% 
IMG OID 
Product7TM receptor with intracellular metal dependent phosphohydrolase 
Protein accessionYP_003308730 
Protein GI269120553 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAGACTAA ATTTTTTTGG AAGAGAAGTA ACTATAACAA TAGATGATAA AAAGATAAAT 
AAAAAAGGTG TAGTTTCCAG TTTTGTACTG AATAATAAAT TCAGATACAG ATTTCTTCTG
CTAAGCAGCA TGTTTCTCAT ATTTGGTATA TTTATGGAAA CAAAAAGGAT AAATATCAGC
TATCATATCG GGGATAAAGC AACAAAGGAC GTAATAGCTT ATAAGGATGT GGTATATTAT
AAAGATCTGC TGGATGAAAG CGTAAAAAAC AGAATTATAG AAAATACGAC TCCTGAATAT
GACAGAAATA AAGAAGTAGA AGACAACTCA GTTTCACAGT TGAATACATT TTTTGATTCC
ATAAACTGGT TTAAGCTTCA GCCCGAAATA GACAATGCCG AGCTAAAAAG CTTTATAGAT
CAGAATAAGC TGAATCTGAG CGTGGATGAA TTAAGAACTA TAATATTGAG AGACAGCAGC
TCGTATATTC TTGCACTGGT AAATGATATG AGAAAAATTT ATGCCGAGGG GATAGTAAAA
AAAGGCGATT TTGATAAAAT AGTTGCTTCA AAGGACTATA AGCTCGGGGC TGAAGAGAAA
AAACTGTTAA AGAATTTCAT GGTAATAAAT ATGAAGTTCA ATCAGGAAAA AACAAAGGAA
AAAATAGATA AAAATATAGA ATCATTAAAA AATCAGGAAA TGAAAATATA TAAAGGCGAC
GTGATACTGA AAAAAGGTGA TGTAATAACA GCAGATGCTT ATGAAAAGCT GGAAAAGCTG
AATATGGTAA AAGTAAGCGA TAAGGCCAGA AAAAGTACCG GATTATTACT TTCATTCGTT
ATTTTATCTA TGGTTTTGTA TTATATTTTG AAAAAATATT CGAAAAAAAT AATGGATTCA
AAGGCATTTT ATCCGAGTCT TATTACAGTA GCAATATTAA ATCTGATATA TCTGGCATTT
TTTCAGGCGG GATTCCTTTT GTACCTTCTG CCTTTTGCAA TTATACCTAT AATTCTCTCT
ATTTTGGGAG ACAGAGTATT CGCCATTACA CTGTCGGTAT TTAATCTGGT TCTTCTTACA
AGAGATGAAA CATGGTTTCT GATAACTCTT GGAGTAACAG TGGTAGCAAT ATATCAGGCT
TCGGCTTTGG TAAACAGAAG CGAGTTCGTA AAGCTGGGAG TATTTCTCGG AGTATTTCAG
GCGCTGCTTT CTGTGGCATA CGGACTTGTA AATCAGTTTC CTATGACTAT ACTGGGGCTT
TTGATTATAC TGTCGGTTTT TTCCGGAATA CTCACAGGAA TGATATGTCT CGCCCTGCTG
CCGTTTTTTG AGAACACCTT TGATATACTG ACCAATATAA AGCTCCTTGA GCTGAGCGAT
TTTTCGCATC CTTTGCTGAG AAGTCTGCTG GTAAAAGCAT CAGGAACATT TCACCATAGT
ATTATGGTGG GAGCACTTGC GGAAAGAGCT GCGGAGTCAG TGGGGGCAAA TGCTACCTTT
GCAAGGGTAG CGTCATATTA TCATGATATA GGGAAAATGA AAAGACCGAA TTTCTTTGTG
GAAAACCAAA AAGGAAGGGA AAATCCGCAT AATCATATAA AACCTACGCT GAGCGCGCTG
ATTATTATTT CGCACACCAA GGACGGCGTG GCTATGGGTA AAAAATATAA TCTTCCAAAG
GAAATTCTGG ATATAATGGT GGAACACCAC GGTACGACAC TTGTGCAGTA TTTTTATAAC
AAGGCTAAGG AAGAGGGCGA AGAGATAAGA GAACAGGATT TCAGATACAG CGGACCGAAG
CCGAGAACAA AAGAATCTGC GATAATACTA ATGGCTGATA CAATAGAAGC AGCAGTAAGA
GCTGCGGAAG ATAAAACAAA GGAAAATGTA GAAAGTCTGG TAAGATATCT GATAAAATAT
AAGATAGAAG ACGGACAGCT TTCAGCTGCG GATATAACTC TGAGAGAGAT AGAAGTAATA
ATAAAGGCCT TTCTGGATGT GCTTCAGGGC GCTTATCACC AGAGAATACA ATATCCTAAG
GTAGGAGAAA ATAAAAAATT AGTGGAAGAC GAGGATTTTA AGCATGAGTA A
 
Protein sequence
MRLNFFGREV TITIDDKKIN KKGVVSSFVL NNKFRYRFLL LSSMFLIFGI FMETKRINIS 
YHIGDKATKD VIAYKDVVYY KDLLDESVKN RIIENTTPEY DRNKEVEDNS VSQLNTFFDS
INWFKLQPEI DNAELKSFID QNKLNLSVDE LRTIILRDSS SYILALVNDM RKIYAEGIVK
KGDFDKIVAS KDYKLGAEEK KLLKNFMVIN MKFNQEKTKE KIDKNIESLK NQEMKIYKGD
VILKKGDVIT ADAYEKLEKL NMVKVSDKAR KSTGLLLSFV ILSMVLYYIL KKYSKKIMDS
KAFYPSLITV AILNLIYLAF FQAGFLLYLL PFAIIPIILS ILGDRVFAIT LSVFNLVLLT
RDETWFLITL GVTVVAIYQA SALVNRSEFV KLGVFLGVFQ ALLSVAYGLV NQFPMTILGL
LIILSVFSGI LTGMICLALL PFFENTFDIL TNIKLLELSD FSHPLLRSLL VKASGTFHHS
IMVGALAERA AESVGANATF ARVASYYHDI GKMKRPNFFV ENQKGRENPH NHIKPTLSAL
IIISHTKDGV AMGKKYNLPK EILDIMVEHH GTTLVQYFYN KAKEEGEEIR EQDFRYSGPK
PRTKESAIIL MADTIEAAVR AAEDKTKENV ESLVRYLIKY KIEDGQLSAA DITLREIEVI
IKAFLDVLQG AYHQRIQYPK VGENKKLVED EDFKHE