Gene GM21_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagGM21_0001 
SymboldnaA 
ID8139527 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameGeobacter sp. M21 
KingdomBacteria 
Replicon accessionNC_012918 
Strand
Start bp119 
End bp1501 
Gene Length1383 bp 
Protein Length460 aa 
Translation table11 
GC content54% 
IMG OID644867618 
Productchromosomal replication initiation protein 
Protein accessionYP_003019846 
Protein GI253698657 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clonesn/a 
Plasmid unclonability p-valuen/a 
Plasmid hitchhikingn/a 
Plasmid clonabilityn/a 
 

Fosmid Coverage information

Num covering fosmid clones82 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAAATA TTTGGCTGGA AGCCCAGACA AATCTTAAGC AAGTATTAAC CGAACAGACA 
TACAGTACGT GGATCGACCC GTTGAAGTTC CTGGGTGCCA CAGTTGACAC CATAGTCCTC
GAAGTCCCCA GTTCGTTCTT TCAAAAATGG GTCACTGACA AATATCTGGC AATGATCAAG
GAAGCCATCT CCGCGGTCAA CGGCAAAAGC TACCAGATAG AGTTCCATGT CGCCGATGAG
AAGCCGGAGG CGGCTCCCGA GGAAAAGCCC GAAAAAGAGG GGAAACCTGC CAGGGAGAAA
GAAAAGGATA AGGACAAGGA AAAAGAGAAG GATAGAGAAA AGGAGAAGGA CAAGAAGGAG
CTGGTTCCCA ATCTGAACCC CAAGTACACC TTCGAGTCTT TCGTCTCGGG TCCCAGCAAC
CAGTTCGCTT ATGCAGCTTC CCAGGCGGTG GCGAACAAGC CGGCCACCAA TTACAACCCG
CTCTTCATCT ACGGCGGGGT GGGCCTCGGC AAGACGCACC TGGTCAACGC CATCGGCAAC
CATATCCTGG CCAAGAACCC GAAGGCGAAG ATCTGCTACT ACTCCTCAGA GAAGTTCATG
AACGAGATGA TCAACTCGCT CCGATACAAG AAGATGGACG AGTTCCGCAA CAAGTTCAGG
AAAATGGACC TGCTGCTCAT CGACGACATA CAGTTCATGG CCGGAAAAGA GGCGACGCAG
GAAGAGTTCT TCCACACCTT CAACGCGCTC TACGAGTCGC ACAAGCAGAT CGTGGTCACC
TCCGACAAGT TTCCCAAGGA CATCCCGGGG CTAGAGGAGC GGTTGAGAAG CCGTTTCGAA
TGGGGGCTGA TCGCCGACAT ACAGCCGCCG GGGGTGGAGA CCAAGGTCGC CATTCTCAAG
AAGAAGTCCG ACATGCACGC GGTCAACCTC CCCGACGACG TGGCGCTCTT TCTCGCGGAA
GGTGCGAACA GCAACATCCG CGAGCTGGAG GGGATGCTGA TCAGGCTGGA GGCGTTTGCA
AGCCTCACCG GTCAGGAGAT AACGCTCAGC ATGGCCCGCG AGGTGATGAA GGACATCATC
GTCGAGAAGA CACGCGACAT CACCGTCGAG ATGATACAGA AGACCGTTGC GGAGCATTTC
CGCATCAAGG TGTCGGAGCT TAAGTCGGAC AAAAGGATCA AGACCCTCGT GGTTCCGCGC
CAGATAGCGA TCTACATCTG CCGCGAGCTC ACCAAGGCGT CCTACCCGGA AATAGGCGAG
AAGTTCGGCG GGAAGGACCA CTCCACCATC ATCCATTCGG TGAAGAAGAT AGAAAAGCAG
ATGGCGGGCG ACGATGAGTT TAAGGCGTCT GTGGAAGACA TAAGGAAAAA GCTGTTCACT
TAA
 
Protein sequence
MENIWLEAQT NLKQVLTEQT YSTWIDPLKF LGATVDTIVL EVPSSFFQKW VTDKYLAMIK 
EAISAVNGKS YQIEFHVADE KPEAAPEEKP EKEGKPAREK EKDKDKEKEK DREKEKDKKE
LVPNLNPKYT FESFVSGPSN QFAYAASQAV ANKPATNYNP LFIYGGVGLG KTHLVNAIGN
HILAKNPKAK ICYYSSEKFM NEMINSLRYK KMDEFRNKFR KMDLLLIDDI QFMAGKEATQ
EEFFHTFNAL YESHKQIVVT SDKFPKDIPG LEERLRSRFE WGLIADIQPP GVETKVAILK
KKSDMHAVNL PDDVALFLAE GANSNIRELE GMLIRLEAFA SLTGQEITLS MAREVMKDII
VEKTRDITVE MIQKTVAEHF RIKVSELKSD KRIKTLVVPR QIAIYICREL TKASYPEIGE
KFGGKDHSTI IHSVKKIEKQ MAGDDEFKAS VEDIRKKLFT