Gene Mthe_0444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_0444 
Symbol 
ID4462583 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp458428 
End bp459498 
Gene Length1071 bp 
Protein Length356 aa 
Translation table11 
GC content53% 
IMG OID639699446 
Productperiplasmic binding protein 
Protein accessionYP_842875 
Protein GI116753757 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones22 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCAAAT GGCTCACTTA TTCAGTATTG CTTTTACTAA TTGCTACCTC ATGCTGTGCA 
GCAGCCGAGT ACCCGATGAC GATCACAGAC TCCGCAGGGC GCGAGGTCAC CATACAGATG
CCTGTGGAGA GGATCATAGT GCTGAACTCC GACGCGGCAG AGGCTGTGAC CATTCTGGGG
GCAGCGGATA AGATCGTGGG GATATCGGAC AGCGTGAAGA ACAAGGCGTA CTACTTCCCC
GCCCTGAAGA ACAGGCAGAG CGTGGGAAAG TGGAACGAGC CTGACTATGA GATGATCGGA
GAGATAGCAA GGAGCGGTGA TGAGATTGTT CCCAACATAA TCGTGATAAG CTATACGTAT
CCCGATAAGC CCTACGGCAT AGTGGAGGTG GCAAAGAGGC TGGAGCCTTT CACGGGCATC
ACTGCAATCG GCCTGGACTT CTACAAGCCG GAGAACATGA CCCGGGAGAT AGAGCTTCTC
GGCAGGATCC TCGGGAAGGA GGCGGAAGCA CAGCGCTTCA TAGAGTGGTA TGAGGAGAAG
CAGGCGGATG TTGAGAACGC TGTGGCGAAC AGGAACGTTC CAAAGGTCTA CGTGGAGTGG
ACATCGAAGG GTGGAGAGCT CACAACGATG GGCACAGGCT CAGGCGCAGC GCAGCTTGTC
TCAATGGCGA GGGGCTACAG CGTAGCGAAT GATCTGAAAG ATGCGTATCC AAAGATCGGG
TGGGAGTGGG TCATCTCGAA GAATCCAGAT GTCATAATAA AGAGATCGAC ATCCACGCAG
CTTGGCTGGG AAAAACCGCC ATCTCTGGAT TCCACTAATC TGGAGAACAC GCTCAACGAA
GTCCTCAGCA GAAGCGGTGC AGCAGCTGTG AATGCTGTGA AGAACGACAG AGTCTACATT
GTCAACTGGG AGATCATGGC CGGATTGGAT GATGTTGTGG GCCTGACATA TCTTGCGAAG
ATCCTGCATC CTGATGTGAA TCTGGATCCG GAGAGCGTTT ACAGGGAGTA CCTCCAGTTC
CTGGGCGTGG ACTATCCTGA GGACAGGATA TTCGTGTACC CTGAAGTGTA A
 
Protein sequence
MIKWLTYSVL LLLIATSCCA AAEYPMTITD SAGREVTIQM PVERIIVLNS DAAEAVTILG 
AADKIVGISD SVKNKAYYFP ALKNRQSVGK WNEPDYEMIG EIARSGDEIV PNIIVISYTY
PDKPYGIVEV AKRLEPFTGI TAIGLDFYKP ENMTREIELL GRILGKEAEA QRFIEWYEEK
QADVENAVAN RNVPKVYVEW TSKGGELTTM GTGSGAAQLV SMARGYSVAN DLKDAYPKIG
WEWVISKNPD VIIKRSTSTQ LGWEKPPSLD STNLENTLNE VLSRSGAAAV NAVKNDRVYI
VNWEIMAGLD DVVGLTYLAK ILHPDVNLDP ESVYREYLQF LGVDYPEDRI FVYPEV