Gene Mpal_1299 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMpal_1299 
Symbol 
ID7271159 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMethanosphaerula palustris E1-9c 
KingdomArchaea 
Replicon accessionNC_011832 
Strand
Start bp1330702 
End bp1333518 
Gene Length2817 bp 
Protein Length938 aa 
Translation table11 
GC content60% 
IMG OID643569933 
ProductDNA topoisomerase type IA central domain protein 
Protein accessionYP_002466356 
Protein GI219851924 
COG category[L] Replication, recombination and repair 
COG ID[COG0550] Topoisomerase IA 
TIGRFAM ID[TIGR01057] DNA topoisomerase I, archaeal 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.596273 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCATCTGA TAATCACTGA AAAGAACATT GCGGCTGACA GAATAGCCCA TATCCTCGCA 
GGAAAGACCT ACGTCGAGGT GAAGAAGGAC GGGGGGGTCA GTACCTACTC GTTCAACGAC
ACGGTCGTGG TCGGTCTTCG GGGGCACGTG GTGGAGGTGG ACTTCGAGCC AGGATACACC
AACTGGCGGA GCGAGGTGAA CACTCCGAGA TCCCTGATCT CGGCCAGGAC CATCAAAGCG
CCGACCGACA AGAAGATCGT CACGCTGATC CAGAAACTGA CCAAAAAAGC CGACCATGTC
ACGATCGCGA CTGATTTTGA TACCGAGGGG GAACTGATCG GGAAGGAGAC CTATGACCTC
GTCAAGGCTG TCAAGCCGGA TGTCAGGGTC GACCGGTCCC GGTTCAGTGC GATCACCGAG
GAGGAGATCC AGACTGCGTT CGCTGCACCG GCCGAGCTTG ATTTTGCTCT GGCTGCAGCC
GGCGAGGCAC GGCAACTGAT CGATCTGATC TGGGGAGCCT CGCTCACCCG GTTCATCAGC
CTGGCCGCCC ACCGCGGCGG CAAGAACATC CTCTCTGTCG GCAGGGTTCA GAGCCCGACC
CTTGCGATGA TGGTCGACCG TGAGAAGGAG ATCGAGGCGT TCGTGCCTGA ACCCTACTGG
GTGCTCACCG TCGATTCAAA CAAGGACGGG GAGCCGGTGC TCGCCCGTCA TACCACAGCA
CGGTTCACTG ATGTGGCGAT AGCAGAGGAG GCGAAAGAAG CCACCCGCGC CCCACTGATG
GTGACCGAGG TGAAGGAGGG ATCCAAGGTC GACCGTGCTC CGACTCCGTT CGATACCACA
GGTTTCATCG TGGCCGCCGG CCGGCTCGGT TTCTCTGCCG CGAACGCGAT GCGGATCGCA
GAGGATCTGT ACATGCACGG GTACATTTCG TATCCCCGTA CAGACAACAC CATCTATCCC
AGGTCGCTCG ATCTGAATGG CGTCCTCAAG ACGCTGCAGA AGACCGAACT CTCGTCTGCT
GTCATATGGG TTATGGCCAA TCGGAGACCA GTGCCGACCC AGGGTAAGAA GTCCACCACC
GACCACCCGC CAATCCACCC AAGTGGAGCG GCGACCAGGG CCGAACTCGG CGACGAGCGG
TGGAAGATCT ATGAACTGGT AGTCCGCCGG TTCCTCGCGA CTCTCTCCCC TGATGCCCGC
TGGATGACGA TGAAGGTCCT CTTTACGGCT GGAAATGAGC CGTACACCAC CACCGGAGCG
ACTCTGCTCG AGGCTGGATG GCGGACTGTC TACCCCTACA GCGATGCGAC CGAACACCCG
CTCCCGCGCT TTGCGGTCGG CGATAGCCTG CCGATCGAGC AGGTGAACCT CGACCGGAAG
GAGACCCAGC CGCCGCCCCG GTACACCCAG AGCAGGCTGA TCCAGCAGAT GGAAGAACTC
GGGCTCGGAA CCAAGAGTAC CCGGCACGAG GTGATCGGGA AACTGGTCGG CCGGAAATAT
ATCGAAGGGA ATCCGCTCAG GCCGACGCTG GTCGGACGGG CCGTGATCGA GTCGCTCGAA
GACCATGCGG CTGCCATCAC CCGACCGGAC ATGACCCAGA CGATCGATGC ACATATGCAG
CAGATCAAAG AGCGGAAGCG GACACGCGAT GATGTGGTGA CCGAATCGCG GGCCATGCTG
AACCGTGCGT TCGACGAACT GGAGGAGCAC CAGAGTCAGA TCGGTGAGGA TATCATGGGG
AGGACCGTCG AGGAGATGAT CCTCGGCCCC TGTCCTGTCT GTGGGAGCGA TCTCCGGATC
CATCATATCC GGAACAGCAG CCAGTTCATC GGATGCACCC GGTATCCGGA CTGCCGGTTC
AACATCGGAC TGCCCCTGAC CCAGTGGGGA TGGGCGATCA GGACCGATAC GGTCTGCCCA
ACCCATCACC TGAACCATGT CAGGTTGATC GCCAAAGGGT CCAGGCCCTG GGATATCGGC
TGCCCGCTCT GTCACCATAT CGAATCCAAT CAGGAGACGA TGGTGCTGAT CCCCTCGATG
ACCGAGGAGA TACTCGGACG GTTGCAGCAG CATCACATCT ATACCGTTCA TGAAGTCGCC
GATGCACCCC CTGAGGCGCT CGCCTCCGCG GCAGAGATTT CATCAACAGC GGCGGAACAC
CTGAAGTCTG AAGGAGAAGC GGTCCTTGGA CTCCTCCGGC TCCGCTCAGA ACTGCGAAAA
TTTGTCAGAA AGCAGGTTCC ACCCCGGCGC GGCAGGAGTC ATGCCAAGAT CATGAAGCAT
CTCCATGCGA ACGGTATCAA TACGATCGCC GACCTCGCAA AGGCGGACCC GACCCTGCTT
CGGACAGCAG GGGTCGGGGA GAAGGAGGTG ACATCCCTCC TTATGCAGGC GAAGGAGTAC
TGCAACGACA AGACGTTGCG TGCCATCGGA GTGCCGGCGA TCAGCCTCAA AAAGTACTAT
GCCGCAGGAA TCCAGAGTCC CGAGGATTTC TGCAGGTATC ATCCGGTCTA TCTGAGTGTC
AAGACCGGAA TCAGTCCGGA CACCACGTTC CGGCATGCAG AGATGGTCTG CATCGCCCAG
AACAGACCGG TGCCCCGCAA AGTGACCAGG GCAATGCTCG AACGGGGGCG TGCCGAACTA
TTGACGATTC CCGGGCTTGG GGAGACGACG ATCGAGAAGC TGTACAGCGG CGGCGTGATC
GACGGCATGA CCCTTGCCTC TGCAGATCCT GCGGCGCTGG CCAGCCATTC CGGGATCCCG
CTCAAGAAGG TACAGGAATT TCAATCGCGG CTCCCTGGTT CCTCTCAGGC CAGTTGA
 
Protein sequence
MHLIITEKNI AADRIAHILA GKTYVEVKKD GGVSTYSFND TVVVGLRGHV VEVDFEPGYT 
NWRSEVNTPR SLISARTIKA PTDKKIVTLI QKLTKKADHV TIATDFDTEG ELIGKETYDL
VKAVKPDVRV DRSRFSAITE EEIQTAFAAP AELDFALAAA GEARQLIDLI WGASLTRFIS
LAAHRGGKNI LSVGRVQSPT LAMMVDREKE IEAFVPEPYW VLTVDSNKDG EPVLARHTTA
RFTDVAIAEE AKEATRAPLM VTEVKEGSKV DRAPTPFDTT GFIVAAGRLG FSAANAMRIA
EDLYMHGYIS YPRTDNTIYP RSLDLNGVLK TLQKTELSSA VIWVMANRRP VPTQGKKSTT
DHPPIHPSGA ATRAELGDER WKIYELVVRR FLATLSPDAR WMTMKVLFTA GNEPYTTTGA
TLLEAGWRTV YPYSDATEHP LPRFAVGDSL PIEQVNLDRK ETQPPPRYTQ SRLIQQMEEL
GLGTKSTRHE VIGKLVGRKY IEGNPLRPTL VGRAVIESLE DHAAAITRPD MTQTIDAHMQ
QIKERKRTRD DVVTESRAML NRAFDELEEH QSQIGEDIMG RTVEEMILGP CPVCGSDLRI
HHIRNSSQFI GCTRYPDCRF NIGLPLTQWG WAIRTDTVCP THHLNHVRLI AKGSRPWDIG
CPLCHHIESN QETMVLIPSM TEEILGRLQQ HHIYTVHEVA DAPPEALASA AEISSTAAEH
LKSEGEAVLG LLRLRSELRK FVRKQVPPRR GRSHAKIMKH LHANGINTIA DLAKADPTLL
RTAGVGEKEV TSLLMQAKEY CNDKTLRAIG VPAISLKKYY AAGIQSPEDF CRYHPVYLSV
KTGISPDTTF RHAEMVCIAQ NRPVPRKVTR AMLERGRAEL LTIPGLGETT IEKLYSGGVI
DGMTLASADP AALASHSGIP LKKVQEFQSR LPGSSQAS