Gene Acid345_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagAcid345_0001 
Symbol 
ID4070011 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCandidatus Koribacter versatilis Ellin345 
KingdomBacteria 
Replicon accessionNC_008009 
Strand
Start bp33 
End bp1418 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content57% 
IMG OID637982001 
Productchromosomal replication initiator protein DnaA 
Protein accessionYP_589080 
Protein GI94967032 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0000000704659 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0525005 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCTCT CGACCACGAC TCCACCAGCT CCGAACCCTT GGCTTCAGGT TCTGGATGCG 
CTGGAGAAGA AAGTAAATCG CCTCTCGTAC GACACCTGGC TCAAACCCAC ACGGTTCAGT
CACTCACAGG GAAACAAGAT CTTCGTTCGC GTGCCGACGC CGGAGTTCCG CCACATTGGC
GAAAAGTACG GCGACCTGAT CCAGGAAGCG CTCGACAGTC TTGGCCTCGG CTTTGAAGAC
GTCGAGTTCG TGACCGATCA AGCGCCCGCG GAAGCACCGA TGCGGCACGA CGGAGGATTT
CCGCCACAGT CAGCGGCAGC CCCTGCGCCG GCGCGAGTTC AACAAGCGCG CTTCGACTGG
GACGGTGCGG CACAGCTGAA TCCCAAGTAC ATCTTCGATA ACTTCGTTAC CGGCCCCGGC
AACCAGTTTG CACACGCTGC GTCGCGTGCA GTGGCGGACC GGCCTTCGAA GACCTACAAC
CCGCTGTTCC TGTACGGCGG CGTGGGAATG GGAAAGACGC ACTTGATGCA GGCCATCGGC
CATACCATCA AGCGTAACAA CCCTGAGCAT TCCATCTGCT ACGTGTCGAG TGAGAAGTTC
ACGAACGACA TGATCAACTC CGTCCGCTAC GACAAGATGA CAAGCTTCCG CGAGCGCTAC
CGGACGGTGG ATGTGCTACT GATTGACGAT ATCCAGTTCA TTGCACGCAA GGAGCGGACC
CAGGAAGAAT TCTTCCATAC GTTCAACGCG TTGCACGAGC AGCAGAAGCA GTTGGTGATT
GCCAGTGACC GTCCGCCGAA AGAGTTGGCA GAGATCGAAG ATCGTCTACG TAGCCGTTTC
GAGTGGGGCC TGATTGCCGA TATTCAGCCT CCCGACCTTG AGACGAAGGT TGCAATCCTT
CAGAAGAAGG CGGAGTCCGA ACGGACGCAG TTGCCCACGG ATGTGGCTCT GTTCATTGCG
TCGAACATTC GCAGCAACGT GCGAGAGTTG GAAGGCGCCT TAATCCGTCT GGTAGCGTAC
TCGTCTCTTA CGGGTGGCGA GTTGAACCTG ATGACCGCGC AGCAGGTGCT GAAGAACATC
ATTGATCAGC AGACGCGGAA GGTGACGATC GAGTCGATTC AGAAGGCCTG CGCTGAGCAG
TTTGGCCTGC GGATCGCCGA AATTAAGGCG AAAAACAATT CGCGGGCGAT TGTGTATCCG
CGCCAAATTG CGATGTACCT GGCGAAGCAC CTGACGGAAG CATCGTTGCC CGAGATTGGG
CGGCAGTTTG GCGGCAAGCA TCATACGACG GTGCTGCACT CGGTGGACAA GATCGACGAG
GCCCGGAAAT CGGACAAAGA TTTGAACAGG CTCCTCAATA AACTTACCGA GAGTCTAAGC
GGATAA
 
Protein sequence
MSLSTTTPPA PNPWLQVLDA LEKKVNRLSY DTWLKPTRFS HSQGNKIFVR VPTPEFRHIG 
EKYGDLIQEA LDSLGLGFED VEFVTDQAPA EAPMRHDGGF PPQSAAAPAP ARVQQARFDW
DGAAQLNPKY IFDNFVTGPG NQFAHAASRA VADRPSKTYN PLFLYGGVGM GKTHLMQAIG
HTIKRNNPEH SICYVSSEKF TNDMINSVRY DKMTSFRERY RTVDVLLIDD IQFIARKERT
QEEFFHTFNA LHEQQKQLVI ASDRPPKELA EIEDRLRSRF EWGLIADIQP PDLETKVAIL
QKKAESERTQ LPTDVALFIA SNIRSNVREL EGALIRLVAY SSLTGGELNL MTAQQVLKNI
IDQQTRKVTI ESIQKACAEQ FGLRIAEIKA KNNSRAIVYP RQIAMYLAKH LTEASLPEIG
RQFGGKHHTT VLHSVDKIDE ARKSDKDLNR LLNKLTESLS G