Gene Moth_0002 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagMoth_0002 
SymboldnaA 
ID3831312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameMoorella thermoacetica ATCC 39073 
KingdomBacteria 
Replicon accessionNC_007644 
Strand
Start bp450 
End bp1778 
Gene Length1329 bp 
Protein Length442 aa 
Translation table11 
GC content49% 
IMG OID637827929 
Productchromosomal replication initiation protein 
Protein accessionYP_428885 
Protein GI83588876 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.0135977 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000101386 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
TTGGCTCCTG TACAGCTCGA TCTGGCCTGG CAACAGGCCT TAACGATTCT CGAAAAACAG 
GTCAGCACCC CTGCCCTGGA AACATGGTTT TACGAAGCCC GGCCGGTTAC CATGCAGGGC
AATACCCTGG TCCTGGCTGT AGCCAACGAA TTTGCCCGGG ACTACATCCA GAGCCGCTAT
TATCCCCTTA TCCAGGAAGC ACTGCAACAG GTCCTGGGGC GAAAAATTAT CAAAATCCAG
GTAATCTGCT TCCCTCTCTC AAGTTCAAAC CAGCGCCAGG AACCGGAACT GGAGGATCCC
AGTCTCCCCC CCCTCAACCC TAAATATACC TTTGAAACCT TTGTTGTGGG TAACAGCAAC
CGTTTTGCCC ATGCAGCCTG CCTGGCCGTA GCCGAATCGC CAGCCAGTTC TTATAATCCG
CTATTTATCT ACGGGGGCGT AGGTCTTGGC AAAACCCACC TGATGCAGGC CATCGGCCAT
CGGGTTCGCC AGCATTTACC AGAACTCCGG GTAATGTACA TCTCTTCGGA AAAATTCACG
AACGACTTGA TTAATGCTAT TAAGGATAAG GCTACAGAAC AGTTCCGCAC CAAGTATCGC
AATATCGATG TTTTATTAAT TGATGATATC CAATTTTTAG CAAAAAAAGA GAGTACCCAG
GAAGAGTTCT TCCATACTTT TAATCATTTA TATGAGGCAA ATAAACAAAT AATCATCTCC
AGTGACCGGC CGCCCAAGGA AATTCCCACC CTGGAAGACC GCCTGCGTTC CCGCTTCGAG
TGGGGCCTGA TCACCGATAT CCAACCGCCT GATCTGGAAA CCAGGATGGC TATTTTACGC
AAAAAAGCTG TTGCCGAGGG TATTAACCTG CCGGATGAAG TCATGTTCTT TATAGCTCAA
AAAATTGATT CTAACATTCG AGAGCTGGAG GGGGCCCTCA TCCGGGTTGC TGCTTACGCC
AATTTTACCA AAAAAGAAAT AACCCCCGGG CTGGCAGAAG AGATTTTAAA AGACGTTCTC
GACCTGGCGC GACCTAAACC GATTACCATT CGTTTAATCC AGGAGACAGT AGCTAATTAC
TTCAATTTGA AGGTAGAAGA TCTAAAAGCC AAGAAGCGCA CGCGTTCCGT GGCTTACCCC
CGTCAAATTG CCATGTACCT CTGTCGGGAA CTAACCGAAT CCTCCCTGCC GGATATCGGT
AAGGAATTTG GCGGCCGGGA TCATACTACT GTTCTCCACG CCTACGACAA GATTCGCGAC
GACCTAAACA CAGATCCTTC CCTTCCCCAG GTAATAGCCC AGATAAGGCA ACAGCTTAGA
AACCAGTAA
 
Protein sequence
MAPVQLDLAW QQALTILEKQ VSTPALETWF YEARPVTMQG NTLVLAVANE FARDYIQSRY 
YPLIQEALQQ VLGRKIIKIQ VICFPLSSSN QRQEPELEDP SLPPLNPKYT FETFVVGNSN
RFAHAACLAV AESPASSYNP LFIYGGVGLG KTHLMQAIGH RVRQHLPELR VMYISSEKFT
NDLINAIKDK ATEQFRTKYR NIDVLLIDDI QFLAKKESTQ EEFFHTFNHL YEANKQIIIS
SDRPPKEIPT LEDRLRSRFE WGLITDIQPP DLETRMAILR KKAVAEGINL PDEVMFFIAQ
KIDSNIRELE GALIRVAAYA NFTKKEITPG LAEEILKDVL DLARPKPITI RLIQETVANY
FNLKVEDLKA KKRTRSVAYP RQIAMYLCRE LTESSLPDIG KEFGGRDHTT VLHAYDKIRD
DLNTDPSLPQ VIAQIRQQLR NQ