Gene Arth_1407 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagArth_1407 
Symbol 
ID4446078 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameArthrobacter sp. FB24 
KingdomBacteria 
Replicon accessionNC_008541 
Strand
Start bp1566800 
End bp1568611 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content66% 
IMG OID639689218 
Productprolyl-tRNA synthetase 
Protein accessionYP_830901 
Protein GI116669968 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0442] Prolyl-tRNA synthetase 
TIGRFAM ID[TIGR00409] prolyl-tRNA synthetase, family II 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.116406 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
GTGGTCCTTC GACTTTCCAA GCTGTTCCTG CGCACCCTGC GCGAAGATCC CGCCGATGCC 
GAAGTGGCGA GCCACCGGCT CCTGGTCCGC GCCGGGTATA TCCGCAGGGC AGCGCCGGGC
ATCTACACCT GGCTGCCGCT CGGCCTGAGC GTGCTGCGCA AGGTGGAGAA GGTCATTCGC
GAGGAAATGG CCGCCATTGG CGCCCAGGAA GTGCACTTCC CGGCGCTCCT GCCCAAGGAG
CCCTACGAGG CCACTAACCG CTGGACCGAG TACGGCGAGG GCATCTTCCG GCTCAAGGAC
CGCAAGGGCG GGGACTATCT CCTGGCCCCG ACGCACGAGG AAATGTTCAC CCTCCTGGTG
AAGGACCTGT ACTCCTCGTA CAAAGACCTT CCGTTGAGCA TCTACCAGAT CCAGAACAAG
TACCGCGACG AAGCCCGCCC CCGCGCGGGC CTGCTGCGCG GCCGCGAGTT CATCATGAAG
GATTCCTACT CGTTCGACGT CGACGACGCC GGTCTGGACG CGAGTTACAA CGCGCACCGC
GCCGCCTACC TGAAGATCTT CGAACGCCTC GGCCTCGAGG TCATTCCGGT GGCTGCCACG
GCGGGAGCCA TGGGTGGCTC CCGGAGCGAG GAGTTTCTGC ACCCCACCGA GATCGGCGAA
GACACCTTCG TGCGGTCCGC CGGCGGCTAC GCGGCCAACG TTGAAGCCGT CACCACTGTG
GTCCCGGCTG AGATCGACTT CAGCAATGCG CCTGCAGCCG AGATCCGGGA CACCCCGAAC
ACCCCCACCA TCGACACTCT TGTGGACGCG GCAAACCAGC TGGTTCCGCG CGATGAGAAC
GACGGCGGCG CATGGACGGC CGCTGACACG CTCAAGAACG TCGTCCTGGC CGTCACCCTG
CCCACCGGTG AGCGCCAGAT CGTCGTCATT GGCGTACCCG GTGACCGCGG TGTTGACCTG
AAGCGGGTTG AGGCCAACAT CGGCGCTTAC CTGCCGGTCG CCGGCGAGAT CACCGTGGAA
GCCGCGGGCG AGGAAGACCT CGCCCGCAAC CCCCTGATCG TCCGGGGATA CCTCGGCCCG
GGAATGTCCC TCGGCACGCC GCTTCTCGGC CTGGAAGGTG CCGCCAAGCT GCTGTACCTG
GTGGATCCCC GAGTCGTCAA AGGCACCGCA TGGGTGACCG GAGCCAACAT GGCCGGCAAG
CACGTCTTCG GCCTTGTGGC CGGCCGCGAC TTCGGCTGGG ACGGAGTGAT CGAGTGCACG
GAAGTGCGCG CCGGGGATGA AGCCCCGGAC GGTTCCGGCC CGCTGGAAAC CGCACGCGGC
ATTGAGATGG GCCACATCTT CCAGCTTGGC CGCAAGTACG CCGAGGCCCT TGAGTTGAAG
GTCCTGGACC AGAACGGCAA GCAGGTGGTG GTCACCATGG GTTCCTACGG CGTGGGCGTC
ACCCGTGCCG TCGCTGCCTT GGCCGAGTCC AACCACGACG CCAAAGGCCT GGTCTGGCCC
CGTGCAGTGG CTCCTGCCGA TGTCCACGTT GTGGCTGTGG GCCGGGGCGA GGAAATCTTC
GCCGCCGCCG AACAGCTGTC ACTCGAGCTC GAAGCCGCCG GCCTCGAGGT CATCTACGAC
GACCGCCCCA AGGTGTCCCC GGGCGTCAAG TTCGGCGACG CGGAACTCAT TGGCGTGCCC
ACCATCCTGG CCGTTGGCCG CGGGCTGGTG GACGGCGTCG TGGAGATCAA GGACCGCCGC
AGCGGTGAGG CAGAGAACGT GGCAGTTGAG AAGGCTGTTG ACTACGTCGT CAACGCCGTC
CGCAGCAAGT AA
 
Protein sequence
MVLRLSKLFL RTLREDPADA EVASHRLLVR AGYIRRAAPG IYTWLPLGLS VLRKVEKVIR 
EEMAAIGAQE VHFPALLPKE PYEATNRWTE YGEGIFRLKD RKGGDYLLAP THEEMFTLLV
KDLYSSYKDL PLSIYQIQNK YRDEARPRAG LLRGREFIMK DSYSFDVDDA GLDASYNAHR
AAYLKIFERL GLEVIPVAAT AGAMGGSRSE EFLHPTEIGE DTFVRSAGGY AANVEAVTTV
VPAEIDFSNA PAAEIRDTPN TPTIDTLVDA ANQLVPRDEN DGGAWTAADT LKNVVLAVTL
PTGERQIVVI GVPGDRGVDL KRVEANIGAY LPVAGEITVE AAGEEDLARN PLIVRGYLGP
GMSLGTPLLG LEGAAKLLYL VDPRVVKGTA WVTGANMAGK HVFGLVAGRD FGWDGVIECT
EVRAGDEAPD GSGPLETARG IEMGHIFQLG RKYAEALELK VLDQNGKQVV VTMGSYGVGV
TRAVAALAES NHDAKGLVWP RAVAPADVHV VAVGRGEEIF AAAEQLSLEL EAAGLEVIYD
DRPKVSPGVK FGDAELIGVP TILAVGRGLV DGVVEIKDRR SGEAENVAVE KAVDYVVNAV
RSK