Gene Caul_4389 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagCaul_4389 
SymbolargS 
ID5901850 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameCaulobacter sp. K31 
KingdomBacteria 
Replicon accessionNC_010338 
Strand
Start bp4763514 
End bp4765355 
Gene Length1842 bp 
Protein Length613 aa 
Translation table11 
GC content68% 
IMG OID641564907 
Productarginyl-tRNA synthetase 
Protein accessionYP_001686007 
Protein GI167648344 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones34 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value0.872152 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT TGAAGCGGTC CCTGAGCGAG GCGGCCGCGG CCGCCTTCCA GGCCGCCGGA 
CTGTCCCCCG ACTTCGGTCG CGTCACCGCG TCCGACCGGC CCGACCTGGC CGATTTCCAG
TGCAACGGGG CCCTGGCGGC AGCCAAGAGC GCCAAGCGCA ATCCGCGCGA GATCGCCGTC
CAGGTGGTCG ACGTCCTGAA GGCCGACCCG CGCCTGGCCT CGGTCGAGAT CGCCGGCGTC
GGCTTCATCA ACATGCGCGT CAGCACCGAC GCCCTGTCGA TCCGCGCCAA CGAGATCGCC
GCCGACCCGC GCGCCGGGGC CGAGCCCCTG GCCCATCCGC GCCGCGTGCT GGTCGACTAT
GCCGGCCCCA ACGTCGCCAA GCCGATGCAC GTGGGCCACC TGCGCGCCTC GATCATCGGC
GAGTCGGTCA AGCGCCTGTA CCGCTTCCGC GGCGACGACG TTGTGGGCGA CGCCCATTTC
GGCGACTGGG GCTTCCAGAT GGGCCTGCTG ATCAGCGCCA TCATGGAGGA GGACGCCTTC
ATCCGCGCCC TGCTCGAGCG CCTGGTCGAG GCCCCGCGCG AGTTCTCCAA GGCCGACGAG
GACAAGGTCA TGTCCGAGTT TGCTCAGCGC GTCACCCTGG ACGACCTCGA CCGCCTCTAT
CCGGCAGCCT CGGCGCGCCA GAAGGAAGAC CCCGAGTTCA AGGAAAAGGC CCGCAAGGCC
ACCGCCGAAC TGCAGAACGG CCGCTTCGGC TACCGCCTGC TGTGGCGTCA CTTCGTCAAC
ATCAGCCGCG TGGCCCTGGA GCGCGAGTTC CATGCCCTGG GCGTCGATTT CGACCTCTGG
AAGGGCGAGA GCGACGTGCA GGACCTGATC GCCCCGATGG TCCGCCAGCT CGAGGTCAAG
GGCCTGCTGG TCGACGACCA GGGCGCCCGC ATCGTCCGCG TGGCCCGTCC CGGCGAGACG
AAGAAGAAGA AGCTGCCCGA CGGCTCGGTG GTCGAGGTCG AAAGTCCCGA CCCGCTGCTG
GTGGTGTCGT CGGAAGGTTC GGCCATGTAC GGCACGACCG ACCTGGCGAC GATCCTGGAT
CGCCGCAAGT CGTTCGACCC GCACCTGATC CTCTATTGCG TGGACCAGCG CCAGGCCGAC
CACTTCGAGC AGGTGTTCCG CGCCGCCTAT CTGGCCGGCT ACGCCGAGCC CGGCAGCCTG
GAGCACATCG GCTTCGGCAC CATGAACGGC AGCGACGGCA AGCCGTTCAA GACCCGGGCC
GGCGGCGTCC TGAAGCTGCA CGACCTGATC GAGATGGCCC GCGAGAAGGC CCGCGAGCGC
CTGCGCGAAG CTGGCCTCGG GGCCGAGCTT TCGCAAGAGG CCTTCGAGGA GACCGCCCAC
AAGGTGGGGA TCGCGGCCCT GAAGTTCGCG GACCTGCAGA ACTTCCGTGG CACCTCCTAC
GTCTTCGACC TCGACCGTTT CACAAGCTTC GAGGGCAAGA CCGGGCCGTA CCTGCTGTAC
CAGTCGGTGC GCATCAAGAG CATCCTGCGC AAGGCGGCCG AGCAGAAGGT CGTGTCGGGC
GCGATCATCG TCGGCGAGCC GGCCGAGCGC GACCTGACGC TGCTGTTGGA CGCGTTCGAA
GGCGCCTTGA GCGAGGCCTA CGACAAGAAG GCCCCCAACT TCATCGCCGA GCACGCCTAC
AAGCTGGCGC AGACCTTCTC GAAGTTCTAC GCCGCTTGCC CGATCCTCAG CGCCGACAAT
GACGCCACCC GGGCCTCGCG CCTCGCCCTG GCCGAGACGA CGTTGAAGCA GCTGGAGCTG
GCTTTGGATC TGCTGGGCAT CGAAGCGCCG GAACGGATGT AG
 
Protein sequence
MSDLKRSLSE AAAAAFQAAG LSPDFGRVTA SDRPDLADFQ CNGALAAAKS AKRNPREIAV 
QVVDVLKADP RLASVEIAGV GFINMRVSTD ALSIRANEIA ADPRAGAEPL AHPRRVLVDY
AGPNVAKPMH VGHLRASIIG ESVKRLYRFR GDDVVGDAHF GDWGFQMGLL ISAIMEEDAF
IRALLERLVE APREFSKADE DKVMSEFAQR VTLDDLDRLY PAASARQKED PEFKEKARKA
TAELQNGRFG YRLLWRHFVN ISRVALEREF HALGVDFDLW KGESDVQDLI APMVRQLEVK
GLLVDDQGAR IVRVARPGET KKKKLPDGSV VEVESPDPLL VVSSEGSAMY GTTDLATILD
RRKSFDPHLI LYCVDQRQAD HFEQVFRAAY LAGYAEPGSL EHIGFGTMNG SDGKPFKTRA
GGVLKLHDLI EMAREKARER LREAGLGAEL SQEAFEETAH KVGIAALKFA DLQNFRGTSY
VFDLDRFTSF EGKTGPYLLY QSVRIKSILR KAAEQKVVSG AIIVGEPAER DLTLLLDAFE
GALSEAYDKK APNFIAEHAY KLAQTFSKFY AACPILSADN DATRASRLAL AETTLKQLEL
ALDLLGIEAP ERM