Gene Saro_0001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSaro_0001 
SymboldnaA 
ID3917659 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNovosphingobium aromaticivorans DSM 12444 
KingdomBacteria 
Replicon accessionNC_007794 
Strand
Start bp24 
End bp1520 
Gene Length1497 bp 
Protein Length498 aa 
Translation table11 
GC content66% 
IMG OID640442726 
Productchromosomal replication initiation protein 
Protein accessionYP_495284 
Protein GI87198027 
COG category[L] Replication, recombination and repair 
COG ID[COG0593] ATPase involved in DNA replication initiation 
TIGRFAM ID[TIGR00362] chromosomal replication initiator protein DnaA 


Plasmid Coverage information

Num covering plasmid clones28 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGAAGAAT TCGCAGGGGG AGCCTTGGCC CGTGGCGTGA ACAGCGCTGC CAATTCAGGC 
GGCGAGCGCG CACAGGCAGG CAAGAAGGGA CGTGAAAGCA ACATGATCGA AGACCAGGAA
GCGCTGGATC TGGCTGCCGA CTGGGCCGAC ATCAGCCAGG GCCTCAAGAA GGACCTGGGG
CCGCAACTCC ACGCCCAGTG GATCAAGCCG ATCCAGCTCG GCTCGTTCTG CAAGGAAACC
GGCACGCTCG ACCTGTTCCT GCCGACCGAA TTCTCCGCCA ACTGGGTTGC AGATCGCTTT
GCCGATCGCC TCAGCCTGGC GTGGAAGATT GCCCGCTCGG AAGTGCGGCA GGTGCGCATC
ACCGTTCACC CACGCCGTCG TTCGATGCCG GAACTGCGCG TCGGCGCGGC GCCGACCGCC
CCGAGCCGCG CCGCGACGCT CGCCGCCACG TCGCCCGTGA TGGTCGATTC GGCGCTTTCG
GGCCTCGACC CCTCGCTGAC CTTCGCGGAA TTCGTCTCGG GTTCGGCCAA CGTGCTGGCG
GTCAACGCCG CGCAGCGCAT GGCAGCCATC GAGGCGCCGC AGTTCTCGCC GCTCTACCTC
AAGGGCTCGA CCGGCCAGGG CAAGACCCAC CTGCTTCACG CCATCGGCCA CGCCTTTGCC
GCCAACAAGC CGGGCGCCCG GATCTTCTAC TGCTCGGCCG AACGGTTCAT GATCGAATTC
GTCCAGGCCA TGCGCTCGAA CGAGATGATC GAGTTCAAGT CTCGGCTGCG CGGATTCGAC
ATGCTGCTGG TCGACGACAT CCAGTTCATC ATCGGCAAGG CTTCGACGCA GGAGGAATTC
CTCCACACGA TCGACGCGCT GATGAGCGCG GGCAAGCGAC TCGTCGTTGC CGCCGACCGT
GCGCCCCAGG CGCTCGACGG GGTCGAACAG CGCCTGCTCT CGCGCCTGTC GATGGGCCTC
GTTGCCGATA TCCAGCCAGC CGACATCGAA CTGCGCCGCA AAATCCTCGA ACACCGCCTC
GCCCGCTTCG GCAACACGCA GGTGCCTTCG GACGTGGTCG AGTTCCTCGC CCGCACGATC
AACCGCAACG TGCGCGAACT GGTGGGCGGG CTCAACAAGC TGATCGCCTA TGCCCAGCTC
ACCGGCCAGC CGGTCTCGCT GCAACTGGCG GAAGAACAGT TGACCGATAT CCTCTCGGCC
AACCGCCGCC GCATCACCAT CGACGAGATC CAGCGCACGG TCTGCCAGTT CTACCGCGTC
GACCGCACCG AAATGGCCAG CAAGCGCCGC GCCCGCGCGG TCGTTCGTCC GCGCCAGGTG
GCGATGTACC TTGCCAAGGT GCTGACGCCG CGCTCTTATC CGGAGATCGG CCGCAAGTTC
GGCGGTCGCG ACCACTCCAC CGTGATCCAC GCGGTCCGTC TGATCGAGGA ACTGCGCGCC
CGCGATGCCG ACATGGACGG CGACGTGCGC ACGCTCCTGC GCCAGCTTGA GGACTGA
 
Protein sequence
MEEFAGGALA RGVNSAANSG GERAQAGKKG RESNMIEDQE ALDLAADWAD ISQGLKKDLG 
PQLHAQWIKP IQLGSFCKET GTLDLFLPTE FSANWVADRF ADRLSLAWKI ARSEVRQVRI
TVHPRRRSMP ELRVGAAPTA PSRAATLAAT SPVMVDSALS GLDPSLTFAE FVSGSANVLA
VNAAQRMAAI EAPQFSPLYL KGSTGQGKTH LLHAIGHAFA ANKPGARIFY CSAERFMIEF
VQAMRSNEMI EFKSRLRGFD MLLVDDIQFI IGKASTQEEF LHTIDALMSA GKRLVVAADR
APQALDGVEQ RLLSRLSMGL VADIQPADIE LRRKILEHRL ARFGNTQVPS DVVEFLARTI
NRNVRELVGG LNKLIAYAQL TGQPVSLQLA EEQLTDILSA NRRRITIDEI QRTVCQFYRV
DRTEMASKRR ARAVVRPRQV AMYLAKVLTP RSYPEIGRKF GGRDHSTVIH AVRLIEELRA
RDADMDGDVR TLLRQLED