Gene Tneu_1444 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTneu_1444 
Symbol 
ID6165055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameThermoproteus neutrophilus V24Sta 
KingdomArchaea 
Replicon accessionNC_010525 
Strand
Start bp1280729 
End bp1283116 
Gene Length2388 bp 
Protein Length795 aa 
Translation table11 
GC content58% 
IMG OID641668602 
ProductSMC domain-containing protein 
Protein accessionYP_001794815 
Protein GI171185896 
COG category[L] Replication, recombination and repair 
COG ID[COG0419] ATPase involved in DNA repair 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.353031 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones50 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTGGAGGA TAGAGAGGAT TGAGTTGGAG AACTTCAGAT CCTACAAAGG CAGACATGAG 
CTACGCATAG GCGACGCCGC GGTCTTCTGG GGCAGAATAG GCGCTGGCAA GACATCGATA
CTGTACGCAA TAGAGTACGC TTTGTTTGGT AGACAGCTTG AGGTGAAGGA AAGAGTTGCG
AGGCTTGTCG ATCTAATCAA CGCAGATGCG CAAGAGGCTA GGGTGGCGCT GGAGTTGCGG
AAAGGCGGCG AGTTGCTGCG CGTTGAGCGA AGGCTGGGGA GGAGAGGCGG CGAGAGGGTT
GTCCTGCACT ACGGCGGCGT AGAACACCGG GGGGATCAAG CCGAGGAGAA GCTGGCGGAG
CTCCTGGGGG CTGACGAAGA CGTCTACGAG CGCCTAGTCT ACATATCGCA TAGGACGCTT
GAGGGCTTCA TATACGGCAC AACGCAGAAG AGGGCTCTCT CGGTGGACCG TCTCTTCGGT
ATAGACGTCG TAGACGGCGT GATCAAGGCC GTGTCTACAT ATGAGAAAGC CCTGCTCAAC
GTTGCGGAGG ATCTCAGGAA GAGGCTTGCG TCGTATGAGA AATACAGGGA GGTCATCAAG
CGGTATGGGG GCTACAGAGG CGTAGAGGCT AGGCTGGGGG CTGTTGAGCA AGAGCTTGAG
GCGTTGAAGA AGAGGGAGGA GGCGCTTTCG GCCGAGGCCG CTGACCTCGC GAAGAGGAGG
ACCGCCTACC TGGAGAAGAT CAGAGAAAAC GAGGGCCTTC TCCTGGAGTA CTACAGAGCG
AAGTCGGAGC TTGAGGTCTT GGAGTCAGAT GCCGGCGGAG ATGTCGACGT ATCTGCGGTG
GAGAAGATAA GGGAGGCGCT TCGGGAGGCA CTAGAGGAAT ACGAACATCT GTTGGGCCGC
GAGCTTGCCG AGAGGCTGGA AAAGGCGGCA GATTTAGAAT CTCTATCCAC GGCCATGTCC
GAGGCCTACG AGGCGCTCCT CAAGCTGGCG GGCGAGTTGG AGGCGCAGAT CTCGGAGACT
AAGAAGACGT ATGAACAATT GCTGGCTAGG GCGAAGCGGC TTGACGAAGA GGTGCGGGAC
GCCGAGGCTA AGCTCAAGAG GTTGGAGAAG TCGTACAACC GCTTTAAGGA GCTTCAGCGG
TCTTTCCAGT CTCTAGACGC CGCAAGAGCG GCCCTCGCTG AGGTGAGGAA GCGGCTTGAC
GAAGTGGAGA GATCCGTGGC GTTTCACTCG GCGTTGAGAA CAGTGGCTTT GTACACGGCG
GAGACTGGGG CTGAGCGGTG TCCGCTGTGT GGCTCGCCGA TAGATCGCAG AGCCGCGCTA
CGGGTTGCTG ATGAGGTAGA GGGGAAATTC GGCGAGCTTA TCCGGGAGGC GGAGGCCCTC
AGGGAAAAGG TAAGAGAGCT TGAGAAGGCC GTAGATGAGA TGGACGCGTT AAGCGGCGAA
GTCGCGGAGT ACCTAGCCAC GAAGGCAAAA GCGCAGGAGC TCGCGCTTGA GAGGGAGGAG
GTGGTCAAGA GGGTTCTCCA GGCGGAGAAG TCTGTGAAAC AGCTGGAGAA GAAGGCAGAG
AGGCTCCGGG CCCTCCTCGC GAGGATAGAT AAGCGCATTA TAGCAGAGGC CGTGGCTAAA
TACGGAAGAG CGCTGAGGGC TAGGGAGCTC CGCCGTAGAG TTAAAGAGCT GGAGGAGGCG
TTGGCAAAGG CGGGGGTGGG GGGCGAAACT CTCGACGTGG AAATGCGGTG GAGAGAGGTT
GTGGCCGACT TAGAAAAAAC GTCGAGCCGC ATGGCGGAGC TTCACAGGGA GAAGACGAGC
CTGGAGGAGG TGGTGAGGGA GGTAGGCGGC GACGCCGAAT CGCTGAAGAA GAAGCTGGAC
TCCGCCCTAT ACGCCTACGG CAGACTCCAG GAGCTGAAGG CCAAGCTGGA GCTGGCTAAG
GTAAACGCCA GGGCTAGGTT GCTGGAGGTG GCAAGATCGA AGTTCAACGA GGTTTTTCTC
TCGCTCTACA AATACGGAGA TTTAGTCGGC GTCAACGCCG ACTTAGAACA GCGCAGGGGG
TACTACGAGT TCTACGCGGT GACGCCCACG GGGGAGCGAT ACGGCATTTC AAAACTGAGC
GACGGGCAGA GGCTCTCCAT CGCCCTCGCC CTAGCCCTGG CCCTCCGTGA TGTCTCGAAC
ATCCACATAG GCTTTGTGAT CTTCGACGAG CCGATACCAT ACGTAGACGT CAACATAAGG
AAGGCCTTCG TAGATCTGGT ACACGCTCTA TCAAACAAGT ACCAGATAAT CGTCGCGACG
CAATCCAGGG AGTTCGCAGA TCTAATAAAA AACGCGCTAA ACGCCAAGCT CTACTCGGTG
GCGAAGAGGG AGAGCTCCGA GGTGAGGGAA GAGAGAGATC AGAGCTAA
 
Protein sequence
MWRIERIELE NFRSYKGRHE LRIGDAAVFW GRIGAGKTSI LYAIEYALFG RQLEVKERVA 
RLVDLINADA QEARVALELR KGGELLRVER RLGRRGGERV VLHYGGVEHR GDQAEEKLAE
LLGADEDVYE RLVYISHRTL EGFIYGTTQK RALSVDRLFG IDVVDGVIKA VSTYEKALLN
VAEDLRKRLA SYEKYREVIK RYGGYRGVEA RLGAVEQELE ALKKREEALS AEAADLAKRR
TAYLEKIREN EGLLLEYYRA KSELEVLESD AGGDVDVSAV EKIREALREA LEEYEHLLGR
ELAERLEKAA DLESLSTAMS EAYEALLKLA GELEAQISET KKTYEQLLAR AKRLDEEVRD
AEAKLKRLEK SYNRFKELQR SFQSLDAARA ALAEVRKRLD EVERSVAFHS ALRTVALYTA
ETGAERCPLC GSPIDRRAAL RVADEVEGKF GELIREAEAL REKVRELEKA VDEMDALSGE
VAEYLATKAK AQELALEREE VVKRVLQAEK SVKQLEKKAE RLRALLARID KRIIAEAVAK
YGRALRAREL RRRVKELEEA LAKAGVGGET LDVEMRWREV VADLEKTSSR MAELHREKTS
LEEVVREVGG DAESLKKKLD SALYAYGRLQ ELKAKLELAK VNARARLLEV ARSKFNEVFL
SLYKYGDLVG VNADLEQRRG YYEFYAVTPT GERYGISKLS DGQRLSIALA LALALRDVSN
IHIGFVIFDE PIPYVDVNIR KAFVDLVHAL SNKYQIIVAT QSREFADLIK NALNAKLYSV
AKRESSEVRE ERDQS