Gene Mthe_1042 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1042 
Symbol 
ID4463110 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1127023 
End bp1128087 
Gene Length1065 bp 
Protein Length354 aa 
Translation table11 
GC content52% 
IMG OID639700060 
Productperiplasmic binding protein 
Protein accessionYP_843466 
Protein GI116754348 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG0614] ABC-type Fe3+-hydroxamate transport system, periplasmic component 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGTTCATCA GAGTGAACCA ATCGATCCAC ATTTTCCTGG CTGCCATGAT TTTGATGGCA 
GTTGGAACTA GTGTGCATGC ATCGGAACCG GACGATCTCA TAACCATAGT AGACTCTGCT
GGAAGAGAGG TGGTGGTTCC GTATCCGGTG GAGTCTGTGG TTGTTCTCTG GAGCAATGCG
GCCAAAGAGA TGAGAGCCCT GGGGGCGGTG GACAGAATTG TGGGCATGGA TCAGTCCACG
AAGGATGAGG TAGATAAGGG GACACTCCCA GAGCTGACGA ACGTACCTGT GGTGGGAACT
CAGGAGGAGC CAAACTACGA GAAGATCGCC GAGCTGAAAC CTGATGTTGT CATATGCCTC
TCAGCTGGGT ATCCACCAGA GCCAGATGAG GTGCAGGAGA AGCTGGACCC ATTTGGGATA
AAAGTCGTCG GACTGGACTT CTACAGGACC GAGGTCTGGT TCGATGAGAT AAGAACACTG
GGGAAGATGC TCGGAAAAGA GGCCGAGGCT GAAGAGTATA TGTCGTTCTT CAGGAGCTAT
TACGACCGTA TCAACCAGAC ACTCGCCACG ATACCAGACC CAGATCGGAA GACCGTCTAT
TTTGAGGGCG CCAAGAAATA CCTCACATAC GGTGGAGCAG GTTATGGCAG TGGCATACCT
AATATGATCC GCGCTGCCGG TGGTAAGGAT CTTTATCCTG AGAGGTCTGA GCTGGCTTTT
GAGGTCGATC CTGAGGATGT CGCCAGAAGG AATCCCGATG TGATATTCAA AGGCACCACC
TTGGGATGGG ATGCAGAGAG CGAGGAGGAG TTCAAGGCCA TCCGGGATGA GATAATGAGC
CGTCCTGAGC TGGCAAACAC AAATGCGGTT AAGAACGGCC AGGTCTACGT AATAAGTTTC
GACGTAGCAG GAGGGGCTGG CAAGAAGTTC GGGCCTGTCT TCCTGGCCAA GGTGCTCTAT
CCGGAGAAGT TCCAGGATAT GGATCCGATG GAGTTCTACA GGGAGTATCT GAGGAGATTC
CAGGGGTTGG AGTACAGAGG TGTATACCTC TATCCAAACC CATGA
 
Protein sequence
MFIRVNQSIH IFLAAMILMA VGTSVHASEP DDLITIVDSA GREVVVPYPV ESVVVLWSNA 
AKEMRALGAV DRIVGMDQST KDEVDKGTLP ELTNVPVVGT QEEPNYEKIA ELKPDVVICL
SAGYPPEPDE VQEKLDPFGI KVVGLDFYRT EVWFDEIRTL GKMLGKEAEA EEYMSFFRSY
YDRINQTLAT IPDPDRKTVY FEGAKKYLTY GGAGYGSGIP NMIRAAGGKD LYPERSELAF
EVDPEDVARR NPDVIFKGTT LGWDAESEEE FKAIRDEIMS RPELANTNAV KNGQVYVISF
DVAGGAGKKF GPVFLAKVLY PEKFQDMDPM EFYREYLRRF QGLEYRGVYL YPNP