Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Nther_1423 |
Symbol | |
ID | 6314552 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Natranaerobius thermophilus JW/NM-WN-LF |
Kingdom | Bacteria |
Replicon accession | NC_010718 |
Strand | + |
Start bp | 1492411 |
End bp | 1494279 |
Gene Length | 1869 bp |
Protein Length | 622 aa |
Translation table | 11 |
GC content | 37% |
IMG OID | 642643803 |
Product | protein of unknown function DUF342 |
Protein accession | YP_001917594 |
Protein GI | 188586049 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0000579847 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 44 |
Fosmid unclonability p-value | 0.370296 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGCAGAAC TACCCCAAAG GTTTTCCGGT GAAAACTTAC AAGAAGTTTT AGCGGAAGCT GCTGAATCAC TATCATGTGA AGTTGAGGAA CTAGAATATA AAGTGATACA GCGAGAAAAG AAAGGCCTTT TAAGGCGAAC CCCCTGTGTC ATTGAAGTGT CTGGACAGCA TAAAAAAAAT GACAACACAA ACACGGGTGA TAATAATGGT ATAGTGGCCG AAACGGCAGC TAGTAATGAC CCTGCGGAAG AGAAACTAAA TGTCAGTATT GATGGATATT ATGAGATATC GGAAGAAGAC AATGCTATTT ATTTAGTTGT ATACCCTCCT GAAAATCGAG GTAATTATGT CAAGTGGAAA GATGTTAAGA GTAAATTAGA AGAAAAAGGT TTTGAAATCC TCGATGAGGC ATTTATAGTA GAAATTGTCA GAAAATCGGA AGGCCAAAAA GTAGATATTT CTGAATATAT TGAGGAACAT GTAATAGATG GATCTTTTGA AATTAGAGTT GCTGAAGACA ATATGAAAGC ACTGTTAAAG GTGAATTTAC CTCAAGGAAG AGGAAAGGAA GTTAATTTAG AAGAAATTAC TCAGGCCCTA AGTGAACGAA AAATCAGTCA AAATTTAGAT TTTCAAGCAA TACATAAATG TGTAAGTGAA GGCACACAAG GCGAATTCAG AACTATTGCC ACCGGAGATC AGCCTATAGA TGGAAAAGAC GCAGAGATCC AACTACATTT TGAAGAAAAA GAAAGAAAAC CTGTAGTTAA AGAAGACGGA AGTGTGGACT ATTATAATAT TGATAATGTT ACTAATGTTA AAGCCGAGGA CCTCTTGGCG AGCAAACATC CTCCAGAGGA GGGTAGTCCC GGTAAAGATG TATATGGAAA TATAGTGTCT CCCAAACCAG GAACTGATCG GCAAATAAAA AGAGGTAAAA ATACCGAGTT AAGCGAAGAT GAAATGGAGT TAAGAGCATC TATAGATGGA CAGGTAGTCA TGAATAATGA CGGGTTTATT CACGTATATC CTGTTTATGA AGTTTCTGGT GATGTGGATG TTTCAACAGG AAATATTGAT TTTGTGGGTA ATGTTATTGT AAAAGGACAG ATAAAAAGTG GTTTAAAGGT TAAGGCTGCT GGGGATGTAG AAGTCCGTAA AAGTGTTGAT AGTTGTATAA TAGAAGCGGG AGGCAATGTC GATATTAAAG GCGGCATTCA AGGTAGGAAC AAAGGGTCTA TTACTGCAGG TGGCTCGGTA ACTTGCAAAT TCATCGAAAA TGCTCAAGTT TCTGCTGAAG GAGATATTAA TGTTATTGAA GGTATTCTCC ATAGTCAGGT AGAAGGTAAT AAAATAAATG TTTTTGAAGG AAAAAAAGGT TTACTCGTAG GTGGCAAAGT AACTGCAAGA GAAGAGGTAG TAGCTAAAAT GATTGGATCC AGTTTTGCCA CTGCCACTCA TGTAGCTGTC GGCTTAGACC CTGAATTAAG GAAAAAGTCT TCAGATATAG ATACAGAACT GAAAAACACC AACGAAAACC TGGAAAAAAC AGATAAAGCT ATTGCAATAC TACAGAAGGT CAAGCAAACT AAAGGGGCGC TGCCTAAGGA TAAAGAAAAT ATGCTTGTTA GATTGCAAAG GACTAAATCC CACTTAGACC AAACAAAACA GCAATTATGC AGCCAAAAAG AGGAAATAAA AAATATTTTA AAAGATAAAA CAGATGGCAG AGTTATAGCA AAAAAGGTGG TTTATCCTGG AGTCAAAGTA ACCATTGGTG AAGTCTCGTA TAATATAAAG GATGAACAAA AGAGTAGTAT GTTTAGATTG TCCTCTGATG GAGAAGTTTC CAGTGAGCCT GTATCTTAA
|
Protein sequence | MAELPQRFSG ENLQEVLAEA AESLSCEVEE LEYKVIQREK KGLLRRTPCV IEVSGQHKKN DNTNTGDNNG IVAETAASND PAEEKLNVSI DGYYEISEED NAIYLVVYPP ENRGNYVKWK DVKSKLEEKG FEILDEAFIV EIVRKSEGQK VDISEYIEEH VIDGSFEIRV AEDNMKALLK VNLPQGRGKE VNLEEITQAL SERKISQNLD FQAIHKCVSE GTQGEFRTIA TGDQPIDGKD AEIQLHFEEK ERKPVVKEDG SVDYYNIDNV TNVKAEDLLA SKHPPEEGSP GKDVYGNIVS PKPGTDRQIK RGKNTELSED EMELRASIDG QVVMNNDGFI HVYPVYEVSG DVDVSTGNID FVGNVIVKGQ IKSGLKVKAA GDVEVRKSVD SCIIEAGGNV DIKGGIQGRN KGSITAGGSV TCKFIENAQV SAEGDINVIE GILHSQVEGN KINVFEGKKG LLVGGKVTAR EEVVAKMIGS SFATATHVAV GLDPELRKKS SDIDTELKNT NENLEKTDKA IAILQKVKQT KGALPKDKEN MLVRLQRTKS HLDQTKQQLC SQKEEIKNIL KDKTDGRVIA KKVVYPGVKV TIGEVSYNIK DEQKSSMFRL SSDGEVSSEP VS
|
| |