Gene Noca_3197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagNoca_3197 
Symbol 
ID4599060 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameNocardioides sp. JS614 
KingdomBacteria 
Replicon accessionNC_008699 
Strand
Start bp3395195 
End bp3396964 
Gene Length1770 bp 
Protein Length589 aa 
Translation table11 
GC content71% 
IMG OID639777803 
Productprolyl-tRNA synthetase 
Protein accessionYP_924386 
Protein GI119717421 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0619868 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGATCATGC GGATGTCGAG CCTGTTCGTG CGGACCCTGC GAGAGGACCC CGCGGACGCC 
GAGGTGCCGT CGCACCGGCT GCTCGTCCGC GCCGGCTACA TCCGCCGCGC CGCCCCCGGC
ATCTACACCT GGCTGCCGCT GGGCCTGAGG GTGCTCCGCA AGATCGAGAA CGTCATCCGC
GAGGAGATGG ACGCGATCGG CGCCCAGGAG ATGCTGTTCC CGGCGCTGCT GCCGCGCGAG
CCCTACGAGG CGACCAACCG GTGGACGGAG TACGGCGACG GCATCTTCCG GCTCCAGGAC
CGCAAGGGCG CCGACTACCT CCTCGGCCCG ACCCACGAGG AGATGTTCAC GCTCGTCGTG
AAGGACCTCT ACTCCTCCTA CAAGGACCTG CCGCTCTCGA TCTACCAGAT CCAGACGAAG
TACCGCGACG AGGCGCGACC CCGCGCCGGG CTGCTGCGCG GGCGCGAGTT CGTGATGAAG
GACTCCTACT CGTTCGACGT CGACGACGCC GGCCTCGACG TGAGCTACCA GAAGCACCGC
GACGCCTACG TGCGGATCTT CGACCGCCTC GGGTTCGAGT ACGTCATCGT CGAGGCGATG
TCCGGCGCGA TGGGCGGGTC GAAGTCCGAG GAGTTCCTCG CCAAGGCGAG CGTCGGAGAG
GACACCTACG TGCGCTGCAC GTTGTGCGAC TACGCCGCGA ACGTCGAGGC CGTGCACTCG
CCGCCGATCC CGCCCGTGCC GTACGACGAC GCGCCCGCCG CCCACGCCGA GCAGACTCCC
GACACGCCCA CGATCGAGAC GCTCGTGGCC CACCTCAACG AGCGGTTCCC CCGCGAGGAC
CGCCCCTGGG CGGCGTCGGA CACCCTGAAG AACGTGGTGT TCTCGGTCCA CCACCCCGAC
GGCGCCACCG AGGCGCTGGC CGTCGGCCTG CCCGGCGACC GCGAGGTCGA CGCCAAGCGG
CTCGAGGCCC ACCTCGGCGA GGGCGTCGTC TTCGAGCCGT TCGGCGAGGC GGACTTCGCG
GCCCGCCCGA CCCTGGCCAA GGGCTACATC GGCCCCGGGG CGCTGGGGGA GAAGAAGCCG
GCCGGCGTGC GCTACCTGCT CGACCCACGG GTCGTCGAGG GCACCCGCTG GGTGACCGGA
GCCGACGCCG CCGGCAGCCA TGTCATCGAC CTGGTGGCCG GCCGCGACTT CTCCGGCGAC
GGCCTGATCG AGGTGGCCGA GGTCCGCGAC GGCGACCCCT GCCCGCGCGG CGACGGCGGC
ACCCTCGAGA CCGCCCGCGG CATCGAGATG GGCCACATCT TCCAGCTCGG CCGCAAGTAC
GCCGACGCCC TCGACCTCAA GGTCCTCGAC GAGCAGGGCA AGCTCGTCAC GGTGACCATG
GGCTCCTACG GCATCGGGCC CTCGCGCGCC GTGGCCGCCA TCGCCGAGGG CACCCACGAC
GAGCTGGGCC TGGCCTGGCC GCGGGAGGTC GCGCCCGCCG ACGTCCACAT CGTCGCCACC
GGCAAGGACG AGCACGTCTT CGCGGCGGCC GAGCGGATCG CGCACGAGCT CGACAAGCAG
GGTGTCGAGG TGCTCTACGA CGACCGGCCC AAGGTCAGCC CGGGGGTGAA GTTCAAGGAC
GCCGAGCTGA TCGGTGTCCC GACGATCGTC GTGGTCGGCA AGGGCCTCGC CGCGGGCACG
ATCGAGGTCA AGGACCGACG TACCGGCCAG CGGCAGGACG TGCCGGCCGA CCACCTGGTC
GACCGGGTGA TCGCCATCGT GAGGGCATGA
 
Protein sequence
MIMRMSSLFV RTLREDPADA EVPSHRLLVR AGYIRRAAPG IYTWLPLGLR VLRKIENVIR 
EEMDAIGAQE MLFPALLPRE PYEATNRWTE YGDGIFRLQD RKGADYLLGP THEEMFTLVV
KDLYSSYKDL PLSIYQIQTK YRDEARPRAG LLRGREFVMK DSYSFDVDDA GLDVSYQKHR
DAYVRIFDRL GFEYVIVEAM SGAMGGSKSE EFLAKASVGE DTYVRCTLCD YAANVEAVHS
PPIPPVPYDD APAAHAEQTP DTPTIETLVA HLNERFPRED RPWAASDTLK NVVFSVHHPD
GATEALAVGL PGDREVDAKR LEAHLGEGVV FEPFGEADFA ARPTLAKGYI GPGALGEKKP
AGVRYLLDPR VVEGTRWVTG ADAAGSHVID LVAGRDFSGD GLIEVAEVRD GDPCPRGDGG
TLETARGIEM GHIFQLGRKY ADALDLKVLD EQGKLVTVTM GSYGIGPSRA VAAIAEGTHD
ELGLAWPREV APADVHIVAT GKDEHVFAAA ERIAHELDKQ GVEVLYDDRP KVSPGVKFKD
AELIGVPTIV VVGKGLAAGT IEVKDRRTGQ RQDVPADHLV DRVIAIVRA