Gene Nther_1423 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNther_1423 
Symbol 
ID6314552 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNatranaerobius thermophilus JW/NM-WN-LF 
KingdomBacteria 
Replicon accessionNC_010718 
Strand
Start bp1492411 
End bp1494279 
Gene Length1869 bp 
Protein Length622 aa 
Translation table11 
GC content37% 
IMG OID642643803 
Productprotein of unknown function DUF342 
Protein accessionYP_001917594 
Protein GI188586049 
COG category[L] Replication, recombination and repair 
COG ID[COG1315] Predicted polymerase, most proteins contain PALM domain, HD hydrolase domain and Zn-ribbon domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000579847 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones44 
Fosmid unclonability p-value0.370296 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCAGAAC TACCCCAAAG GTTTTCCGGT GAAAACTTAC AAGAAGTTTT AGCGGAAGCT 
GCTGAATCAC TATCATGTGA AGTTGAGGAA CTAGAATATA AAGTGATACA GCGAGAAAAG
AAAGGCCTTT TAAGGCGAAC CCCCTGTGTC ATTGAAGTGT CTGGACAGCA TAAAAAAAAT
GACAACACAA ACACGGGTGA TAATAATGGT ATAGTGGCCG AAACGGCAGC TAGTAATGAC
CCTGCGGAAG AGAAACTAAA TGTCAGTATT GATGGATATT ATGAGATATC GGAAGAAGAC
AATGCTATTT ATTTAGTTGT ATACCCTCCT GAAAATCGAG GTAATTATGT CAAGTGGAAA
GATGTTAAGA GTAAATTAGA AGAAAAAGGT TTTGAAATCC TCGATGAGGC ATTTATAGTA
GAAATTGTCA GAAAATCGGA AGGCCAAAAA GTAGATATTT CTGAATATAT TGAGGAACAT
GTAATAGATG GATCTTTTGA AATTAGAGTT GCTGAAGACA ATATGAAAGC ACTGTTAAAG
GTGAATTTAC CTCAAGGAAG AGGAAAGGAA GTTAATTTAG AAGAAATTAC TCAGGCCCTA
AGTGAACGAA AAATCAGTCA AAATTTAGAT TTTCAAGCAA TACATAAATG TGTAAGTGAA
GGCACACAAG GCGAATTCAG AACTATTGCC ACCGGAGATC AGCCTATAGA TGGAAAAGAC
GCAGAGATCC AACTACATTT TGAAGAAAAA GAAAGAAAAC CTGTAGTTAA AGAAGACGGA
AGTGTGGACT ATTATAATAT TGATAATGTT ACTAATGTTA AAGCCGAGGA CCTCTTGGCG
AGCAAACATC CTCCAGAGGA GGGTAGTCCC GGTAAAGATG TATATGGAAA TATAGTGTCT
CCCAAACCAG GAACTGATCG GCAAATAAAA AGAGGTAAAA ATACCGAGTT AAGCGAAGAT
GAAATGGAGT TAAGAGCATC TATAGATGGA CAGGTAGTCA TGAATAATGA CGGGTTTATT
CACGTATATC CTGTTTATGA AGTTTCTGGT GATGTGGATG TTTCAACAGG AAATATTGAT
TTTGTGGGTA ATGTTATTGT AAAAGGACAG ATAAAAAGTG GTTTAAAGGT TAAGGCTGCT
GGGGATGTAG AAGTCCGTAA AAGTGTTGAT AGTTGTATAA TAGAAGCGGG AGGCAATGTC
GATATTAAAG GCGGCATTCA AGGTAGGAAC AAAGGGTCTA TTACTGCAGG TGGCTCGGTA
ACTTGCAAAT TCATCGAAAA TGCTCAAGTT TCTGCTGAAG GAGATATTAA TGTTATTGAA
GGTATTCTCC ATAGTCAGGT AGAAGGTAAT AAAATAAATG TTTTTGAAGG AAAAAAAGGT
TTACTCGTAG GTGGCAAAGT AACTGCAAGA GAAGAGGTAG TAGCTAAAAT GATTGGATCC
AGTTTTGCCA CTGCCACTCA TGTAGCTGTC GGCTTAGACC CTGAATTAAG GAAAAAGTCT
TCAGATATAG ATACAGAACT GAAAAACACC AACGAAAACC TGGAAAAAAC AGATAAAGCT
ATTGCAATAC TACAGAAGGT CAAGCAAACT AAAGGGGCGC TGCCTAAGGA TAAAGAAAAT
ATGCTTGTTA GATTGCAAAG GACTAAATCC CACTTAGACC AAACAAAACA GCAATTATGC
AGCCAAAAAG AGGAAATAAA AAATATTTTA AAAGATAAAA CAGATGGCAG AGTTATAGCA
AAAAAGGTGG TTTATCCTGG AGTCAAAGTA ACCATTGGTG AAGTCTCGTA TAATATAAAG
GATGAACAAA AGAGTAGTAT GTTTAGATTG TCCTCTGATG GAGAAGTTTC CAGTGAGCCT
GTATCTTAA
 
Protein sequence
MAELPQRFSG ENLQEVLAEA AESLSCEVEE LEYKVIQREK KGLLRRTPCV IEVSGQHKKN 
DNTNTGDNNG IVAETAASND PAEEKLNVSI DGYYEISEED NAIYLVVYPP ENRGNYVKWK
DVKSKLEEKG FEILDEAFIV EIVRKSEGQK VDISEYIEEH VIDGSFEIRV AEDNMKALLK
VNLPQGRGKE VNLEEITQAL SERKISQNLD FQAIHKCVSE GTQGEFRTIA TGDQPIDGKD
AEIQLHFEEK ERKPVVKEDG SVDYYNIDNV TNVKAEDLLA SKHPPEEGSP GKDVYGNIVS
PKPGTDRQIK RGKNTELSED EMELRASIDG QVVMNNDGFI HVYPVYEVSG DVDVSTGNID
FVGNVIVKGQ IKSGLKVKAA GDVEVRKSVD SCIIEAGGNV DIKGGIQGRN KGSITAGGSV
TCKFIENAQV SAEGDINVIE GILHSQVEGN KINVFEGKKG LLVGGKVTAR EEVVAKMIGS
SFATATHVAV GLDPELRKKS SDIDTELKNT NENLEKTDKA IAILQKVKQT KGALPKDKEN
MLVRLQRTKS HLDQTKQQLC SQKEEIKNIL KDKTDGRVIA KKVVYPGVKV TIGEVSYNIK
DEQKSSMFRL SSDGEVSSEP VS