Gene Mthe_1335 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1335 
Symbol 
ID4462141 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1439074 
End bp1440984 
Gene Length1911 bp 
Protein Length636 aa 
Translation table11 
GC content54% 
IMG OID639700352 
Productserine phosphatase 
Protein accessionYP_843751 
Protein GI116754633 
COG category[K] Transcription
[T] Signal transduction mechanisms 
COG ID[COG2208] Serine phosphatase RsbU, regulator of sigma subunit 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATGAGAC CGCCCATATC TTCGACTCTG ATCGTGATCA TGGTGATTGC AGCAGTGGTA 
CCTGCCGCAA TTATGGGGCT GCTCTTCAAT ACAGAGGTCA GCAGAATGGT GGGGCCGATA
CAGGAGCAGC TTGGCGATAT AAACAGCACC GCAGTAAACT ACTCCTCGGG CGCAACAGAC
CAGGAGCTTA TCGTCTCTTC CAAGGCCATG CAGTACGAGG AGTTTTTCAG GAGGATCGCG
GAGAGCAATC AGTTCGTGGC AGACTATGCT GCTTCCGGTT TCTCTGATAT CGATCGAGTG
GCAGATCCGA ACAGCCCCCT CTCACAGACC CTAGCTAGGG CGATAAAGAG GAACAGCGCG
ATCGAGCGAA TATACCTGGC GACCGCTGAT GGAAGAATAG CTTCATGGCC GGAGACAGAT
GGAATCAGGA ATTATGCCAT AAACGCATCC GAACTCAGAT CACTGGGATG GTACAATGCG
GCACAGGCTG CCGGCGGAAC TGTATGGATT CCCGGAGACG AGATGCACAT GATGTGCGCA
ACGCCTGCAT ACTGGAATAT CACACTCTAC TGCGTTGCTG CATCAGAGGT ATCTCTGTCA
GATCTCTACT CAGATCTATC GATGCTCAGA GGCAGCGGCT ATCCATTCAT AGTGAACAGA
AGTGGCGATG TGGTGATGAT TCCCAAGGTC CGGAGGGGTG ATGCCCCATG GGACAATCTG
CTCCTCTCCG GAAACCTCTA CAAATCCAAC ATCTCTGCAT TAGCGGAGCT CGGCGATCGT
ATATCAAAGG GTAAGAGCGG CTCGGATTAT CTCATGATAG ATGGCCGGGG CTGGTTTGTG
GTTTATTCGC CCGTGAAGAG TGTTGGGTGG ACAGTCGTTG TTGCGTACCC ATCCGAGCGG
ATGATGGTTC CCCTAAGCAT CGTGGAGAGA AGCGCGAACG CTCTCAGTCA GAGAGCCGTG
GAACTCCTGC GCAGCAGCAC AGCGGCAATG TTCTCAAAAG GACTGTTACT GATAATCATC
TCTGGTGTGG CATTCGGATT GATAGGAATA ATGATCCGCA GGCAGCTCAG GAGATCTGCA
GGCTGCATAT CTGATGCACT GCAGCGCATC GGCGGGGGTG AGCTGGAGAG GCGCGTGCCT
GTGGAGTGCG ATATCGAGGG GATTGTGCAA TCCATTGAGT CTATGCGCCA GTCTCTCAGA
ACGCTCCTCG AGGGGGCCAA AGCGGAGAGT TATGCAAGGG GATCACAGGA GTGCAAGAGC
TCTGTATTAA AATCATTTGA CACGTATCTG ACTGCCGGTA CTCTTCCTCT CATCGAGGGA
TATGATCTCA GCATTCGCCA GATATCGAGA GGAAGCACGT TCCACGACGT TCTGGAGATC
CAGCACGGAA AGGTCGCTCT GTGCATGGGT AGAGCAAATG GGGAGGAGAT GGAGTCTGCC
GTCCTCGCTG CCATCGCCAG AGCGGTTATA AGAGCGCTTC CATCTCAGCA CCCGGATGAG
GTGATAAAGC GCGCGAACAG CATACTGGCA AAGAGCTCAT CATCCCCCAT CTCCTGCTTC
TACGCGGTCC TCGATCACGG GCAGGGGGAG CTGGTGTACT CAAATGCAGG CCACGCTCCG
CCGTTTGTGG TGAGCCGGGA TGGATCCGTG GATACCCTCT GCGGCGATGG GATACCCATG
ACGATCAGGG ACGATCTCAA ACTCGGGTAT GAGCGCCGAC CCATTTCGAA GGGAGATGTC
CTTGTGATCT ACTCTGAGGG CATGATAGAG GCGCAGGGCT TCGACCTGGA GCGTCTGATA
GGTGTGGCTC GCGGCTCCAG AACAAAGAGT GCATCAGAGA TAGCGGACGA TATAGAAAGG
GCGGTCCCGA AGGGGGATGG CATGGCGGTC ATGGTGATGA AATCAGTTTG A
 
Protein sequence
MMRPPISSTL IVIMVIAAVV PAAIMGLLFN TEVSRMVGPI QEQLGDINST AVNYSSGATD 
QELIVSSKAM QYEEFFRRIA ESNQFVADYA ASGFSDIDRV ADPNSPLSQT LARAIKRNSA
IERIYLATAD GRIASWPETD GIRNYAINAS ELRSLGWYNA AQAAGGTVWI PGDEMHMMCA
TPAYWNITLY CVAASEVSLS DLYSDLSMLR GSGYPFIVNR SGDVVMIPKV RRGDAPWDNL
LLSGNLYKSN ISALAELGDR ISKGKSGSDY LMIDGRGWFV VYSPVKSVGW TVVVAYPSER
MMVPLSIVER SANALSQRAV ELLRSSTAAM FSKGLLLIII SGVAFGLIGI MIRRQLRRSA
GCISDALQRI GGGELERRVP VECDIEGIVQ SIESMRQSLR TLLEGAKAES YARGSQECKS
SVLKSFDTYL TAGTLPLIEG YDLSIRQISR GSTFHDVLEI QHGKVALCMG RANGEEMESA
VLAAIARAVI RALPSQHPDE VIKRANSILA KSSSSPISCF YAVLDHGQGE LVYSNAGHAP
PFVVSRDGSV DTLCGDGIPM TIRDDLKLGY ERRPISKGDV LVIYSEGMIE AQGFDLERLI
GVARGSRTKS ASEIADDIER AVPKGDGMAV MVMKSV