Gene Noca_2445 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_2445 
Symbol 
ID4599786 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp2607448 
End bp2608428 
Gene Length981 bp 
Protein Length326 aa 
Translation table11 
GC content71% 
IMG OID639777047 
Productputative DNA ligase (ATP), C-terminal 
Protein accessionYP_923636 
Protein GI119716671 
COG category[L] Replication, recombination and repair 
COG ID[COG3285] Predicted eukaryotic-type DNA primase 
TIGRFAM ID[TIGR02776] DNA ligase D
[TIGR02778] DNA polymerase LigD, polymerase domain 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value0.858354 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGCCCGCCT CGAAGACGCC CGCGGTCGAG ATCGAGGTCG ACGACCGGGT CGTGCGGATC 
AGCAACCCCG ACCGGGTGTA CTTCCCCGAC AGCGGAGCGA CCAAGCTCGA CCTGGTCGAG
TACTACCTCG CGGTCGGTCC GGGCATCGTC AACGCCCTGT TCGAGCGGCC GTGCATGCTG
CACCGCTTCC CGAAGGGCCT GGCGGGGGAG AAGGTGCACC AGAAACGGCT GCCGAAGGGC
GCACCGCCCT GGGTGGAGAC GGTCCGCCTG CACTTCCCGC GCTGGAACCG TACGGCGGAC
GAGCTCTGCG TCACCGAGCT GGGCAGCGTG ATCTGGGCGG TGCAGATGTC CACGGTCGAG
TTCCATCCCT GGAACAGCCG CCGCGAGGAC ACCGAGCGGC CGGACGAGTG GCGCATCGAC
CTCGACCCCG GCCCGGAGTG CTCCTACGAC CGGGTCCGGC GGGTGGCCCA CGTCGCCCAC
GAGGTCCTCG ACGAGCTCGG CGTCGTCGGC TTCCCGAAGA CCAGTGGCAG CAAGGGACTG
CACGTGTACG TCCGGATCCG CCCGGACCAT GGGTTCAAGG AGGTACGCCG GGCCGCGCTC
GCCTTCGCCC GCGAGGTCGA GCGCCGGGCG CCCGAGGACG TGGACCTGAC CTGGTGGCGC
AAGGACCGGG ACCCCGCGAC GGTCTTCGTC GACTACAACC AGAACGCCCG TGACCACACC
ATCGCGGCGG CGTACTCCGT CCGCGGCCTC CCCGACGCCC GGGTCTCCAC GCCCATCCGG
TGGGACGAGG TCGACGACGC CGACCCGCGT GACTTCACGA TCTTCACGGT GCCCGAGCGG
TTCGCCCGGC TCGGCGACCT GCACACCGAT ATCGACGACG GCGGCGGCCG GGGGCCGTTC
GACATCGCGT CGCTCCTGGA GTGGGCCGAC CGCGACGAGC GCGACGGCGC CGCCGGTCCC
GACGACGAGC AGTCCGAATA G
 
Protein sequence
MPASKTPAVE IEVDDRVVRI SNPDRVYFPD SGATKLDLVE YYLAVGPGIV NALFERPCML 
HRFPKGLAGE KVHQKRLPKG APPWVETVRL HFPRWNRTAD ELCVTELGSV IWAVQMSTVE
FHPWNSRRED TERPDEWRID LDPGPECSYD RVRRVAHVAH EVLDELGVVG FPKTSGSKGL
HVYVRIRPDH GFKEVRRAAL AFAREVERRA PEDVDLTWWR KDRDPATVFV DYNQNARDHT
IAAAYSVRGL PDARVSTPIR WDEVDDADPR DFTIFTVPER FARLGDLHTD IDDGGGRGPF
DIASLLEWAD RDERDGAAGP DDEQSE