Gene Mthe_1443 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMthe_1443 
Symbol 
ID4461897 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosaeta thermophila PT 
KingdomArchaea 
Replicon accessionNC_008553 
Strand
Start bp1542346 
End bp1543563 
Gene Length1218 bp 
Protein Length405 aa 
Translation table11 
GC content55% 
IMG OID639700462 
Productreplication factor A 
Protein accessionYP_843857 
Protein GI116754739 
COG category[L] Replication, recombination and repair 
COG ID[COG1599] Single-stranded DNA-binding replication protein A (RPA), large (70 kD) subunit and related ssDNA-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGATA TTGAGCAGAT CATATCCAGG CTCAGCGAGC TTGGGGTCGA TGTCAACATC 
GAGGATGTGG AGAATAGGTA CCGTCTTCTT GTCGACAAGT TTCAGGTCCC TCCTCGGGAG
GCGCAGAGAA GCGTTCTGAA TTTTTATTTG AAAGAGAAAG GCATCGCGCT CCCCTCCAGA
AAGAGCGAGC AGGTGAAGAT AAACCAGATC CGCGAGCCTG GAAGGTGGGT GGACCTTGAG
GCGAAGGTTC TGAGCCTCTT CGAATCACCC AGCCCGGCTA TCTCCCAGGC AGGGATCCTG
GGAGATGACA CAGGAAACAT ACGGTTCGTC AAGTGGGCGA AGTCCGGGCA GCCAGACCTT
GTCGAGGGAA AGAGCTATCT CCTAAAGAAT CTCGTAACTG ACGAGTTCCA GGGAAGGTTC
AGCGTCAAGA TAAACAGAAG CACCGAGATC GCGGAGCTCG ACAGAGAGAT CGAGTCTGTT
GTGCTTCCAC AGAGCTCCGC AGATTACAGG GTTGTGGACA TCAGCGGGCC AGGGCAGTGG
ATCAACCTCC GGGCCAAGGT CGTCCAGCTC TGGGAGCCCT CAAGCGAGTC GATCCAGCAG
CAGGGCCTGC TCGGAGATGA GACCGGAGTC GTTAGGTTCG TCAAGTGGGC GAAGTCCGGG
CAGCCAGACC TTGTCGAGGG AAAGAGCTAC CTCCTAAAGA ATCTCGTAAC TGATGAGTTC
CAGGGAAGGT TCAGCGTCAA GATAAACAGA AGCACGGTCA TAGAGGAGAC CGACGAGCCC
ATAGAGGTAT CTCTAAACAG AAGGATCACA GGTGCCATAG TGGATATACA GAAAGGCTCT
GGACTGATCA AGAGGTGCCC GACATGCAGG AGGCCGCTCT CCAAAGGTAT GTGCACAGAT
CACGGGAAGG TTGAGGGCGT ATACGATCTC AGAGTGAAGG CTGTAATAGA CGATGGCCTC
GTCGCTCAAG ACATACTGAT AAACCGCGAG CGCGTTGAGG AACTCATCGG CCTCACCATG
GAGCAGGCGA AGGAGATGGC CATAGAGGCG CTGGATCACG AGGTCGTCCT CGCGCTCATC
GAGGAGAAGC TGATAGGAAG GTACTTCGAG GTCACCGGGC CAGTTAGGGA CAGGTATCTG
CTCGTGGACA GCATAAATGA GATGACCTTC AGCGATGATG ATGTCTCCCT GCTCGTGAGC
AGGGCGGAGG GACTATGA
 
Protein sequence
MEDIEQIISR LSELGVDVNI EDVENRYRLL VDKFQVPPRE AQRSVLNFYL KEKGIALPSR 
KSEQVKINQI REPGRWVDLE AKVLSLFESP SPAISQAGIL GDDTGNIRFV KWAKSGQPDL
VEGKSYLLKN LVTDEFQGRF SVKINRSTEI AELDREIESV VLPQSSADYR VVDISGPGQW
INLRAKVVQL WEPSSESIQQ QGLLGDETGV VRFVKWAKSG QPDLVEGKSY LLKNLVTDEF
QGRFSVKINR STVIEETDEP IEVSLNRRIT GAIVDIQKGS GLIKRCPTCR RPLSKGMCTD
HGKVEGVYDL RVKAVIDDGL VAQDILINRE RVEELIGLTM EQAKEMAIEA LDHEVVLALI
EEKLIGRYFE VTGPVRDRYL LVDSINEMTF SDDDVSLLVS RAEGL