Gene Cthe_2371 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCthe_2371 
SymboldnaA 
ID4809009 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameClostridium thermocellum ATCC 27405 
KingdomBacteria 
Replicon accessionNC_009012 
Strand
Start bp2834293 
End bp2835624 
Gene Length1332 bp 
Protein Length443 aa 
Translation table11 
GC content42% 
IMG OID640107782 
Productchromosomal replication initiation protein 
Protein accessionYP_001038766 
Protein GI125974856 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0166544 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGAATACTC AGTTGAATGA AATATGGCAA AAAACTTTAG GACTGCTTAA AAATGAGCTT 
ACAGAAATCA GTTTTAACAC CTGGATCAAG ACCATCGATC CATTGTCCTT GACAGGCAAT
ACTATAAACC TGGCTGTCCC GGCGGAATTC AACAAGGGAA TTCTTGAGTC CAGGTATCAA
ACTCTGATTA AAAATGCCAT TAAGCAAGTT ACTTTTAAGG AATACGAGAT TGCATTTATC
GTGCCTTCAC AGGAAAATTT AAACAAGCTG ACGAAGCAGA CCGAGTCCGC CGGCAATGAG
GATTCTCCTT TGTCAGTATT AAACCCGAAG TACACGTTTG ACACTTTTGT CATAGGAAAC
AGCAACAGAT TTGCACACGC AGCCGCACTG GCCGTGGCCG AGGCACCGGG AAAAGCATAC
AATCCCTTGT TCATATATGG CGGAGTGGGA CTTGGGAAGA CTCATCTTAT GCATGCCATC
GGGCACTACA TTCTGGAACA GAATTCTTCC CAAAAGGTTT TGTATGTTTC ATCTGAAAAA
TTTACCAACG AACTTATCAA TGCCATTAAA GACAACAGAA ATGAAGAATT CAGATCCAAA
TACAGAAATA TTGACGTACT GCTTATAGAC GACATACAAT TCATTGCCGG AAAGGAAAGA
ACGGAGGAGG AGTTCTTCCA TACCTTCAAT GCTCTTTACG AAGCAAACAA ACAGATAATC
CTGTCAAGCG ACAAGCCTCC GAAAGAAATT TCTCTTGAGG ACCGCCTGAG ATCCAGGTTT
GAATGGGGCT TGATTGCGGA CATGCAGGCA CCGGATCTGG AAACCAGGAT AGCAATACTA
AGGAAAAAAG CCCAGCTTGA AAACCTTACT GTTCCAAATG AAGTAATTGT ATTCATTGCA
GACAAGATAG CATCAAACAT CAGAGAACTT GAAGGTGCCT TAAACAGAGT AATAGCATAT
TCATCGCTTA CGGAAAACGA AATTACCGTC GAACTCGCCA GCGAAGCATT AAAAGACATA
CTGTCAGCAA ACAAGGCGAA AGTTTTAAAC TGCACCACAA TCCAGGAAGC AGTGGCCAGA
TACTTTGACA TAAGACCGGA AGAATTTAAA TCAAAGAAGA GGACAAGGGA CATCGCATTC
CCAAGACAAA TTGCAATGTA CCTGTGCAGA GAACTTACCG AAATGTCCCT CCCAAAAATC
GGCGAGGAAT TCGGCGGAAG AGATCATACT ACTGTAATAC ATGCATGTGA AAAGATAAGT
GAAGAAATCG AAAGCAACTC CGAAACCAGG AGGGCCGTAA GTGAAATAAA GAGGAACCTG
CTGGGAAAAT AA
 
Protein sequence
MNTQLNEIWQ KTLGLLKNEL TEISFNTWIK TIDPLSLTGN TINLAVPAEF NKGILESRYQ 
TLIKNAIKQV TFKEYEIAFI VPSQENLNKL TKQTESAGNE DSPLSVLNPK YTFDTFVIGN
SNRFAHAAAL AVAEAPGKAY NPLFIYGGVG LGKTHLMHAI GHYILEQNSS QKVLYVSSEK
FTNELINAIK DNRNEEFRSK YRNIDVLLID DIQFIAGKER TEEEFFHTFN ALYEANKQII
LSSDKPPKEI SLEDRLRSRF EWGLIADMQA PDLETRIAIL RKKAQLENLT VPNEVIVFIA
DKIASNIREL EGALNRVIAY SSLTENEITV ELASEALKDI LSANKAKVLN CTTIQEAVAR
YFDIRPEEFK SKKRTRDIAF PRQIAMYLCR ELTEMSLPKI GEEFGGRDHT TVIHACEKIS
EEIESNSETR RAVSEIKRNL LGK