Gene SAG2103 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagSAG2103 
SymbolargS 
ID1014914 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameStreptococcus agalactiae 2603V/R 
KingdomBacteria 
Replicon accessionNC_004116 
Strand
Start bp2083466 
End bp2085157 
Gene Length1692 bp 
Protein Length563 aa 
Translation table11 
GC content38% 
IMG OID637317268 
Productarginyl-tRNA synthetase 
Protein accessionNP_689088 
Protein GI22538237 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00628973 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGGATACAA AACATCTAAT TGCGAGTGAA ATTCAAAAAG TTGTTCCAGA TATGGAACAA 
TCGACCATTC TTTCTTTATT AGAAACCCCA AAAAATTCGA GCATGGGAGA TTTAGCTTTC
CCAGCATTCT CTTTGGCTAA AACTCTACGC AAAGCACCTC AAATAATTGC TAGTGACATT
GCTGAACAAA TTAAAAGCGA CCAATTTGAA AAGGTGGAAG CTGTTGGACC TTACGTTAAC
TTTTTCCTTG ATAAAGCCGC AATCTCAAGT CAAGTTTTAA AACAAGTCTT ATCTGATGGT
TCTGCCTATG CTACTCAAAA TATTGGTGAA GGACGAAATG TTGCCATTGA CATGTCTAGT
CCAAATATTG CTAAGCCATT CTCAATTGGA CACCTTCGTT CAACAGTTAT TGGTGATAGT
TTAGCTAATA TTTTTGATAA AATTGGCTAT CATCCTGTTA AAATTAATCA CCTTGGCGAC
TGGGGTAAAC AATTTGGAAT GTTAATCGTT GCCTATAAAA AATGGGGCAA TGAAGAGGCT
GTCCGTGCTC ATCCCATCGA TGAACTTTTA AAACTTTATG TCCGTATTAA TGCTGAAGCC
GAGACAGACC CTAGCGTTGA TGAAGAGGCA CGTGAGTGGT TTCGTAAACT TGAAGCTAAC
GATCCTGAAG CAACTGAATT ATGGCAATGG TTCCGTGACG AGTCATTATT AGAATTCAAC
CGTCTTTACG ATCAAATGAA TGTTACATTT GATAGCTATA ACGGCGAGGC ATTCTACAAC
GACAAAATGG ATGAAGTGCT TGAACTGCTA GAATCTAAAA ACCTCCTAGT CGAATCTAAA
GGTGCACAAG TTGTTAACCT AGAAAAATAC GGTATTGAAC ATCCTGCCCT CATTAAAAAA
TCCGATGGAG CAACTCTCTA CATTACACGT GACTTAGCTG CTGCACTTTA CCGTAAACGT
ACCTACGATT TTGCAAAATC AATCTACGTT GTAGGAAACG AACAGTCTGC ACACTTTAAG
CAACTAAAAG CTGTGCTTAA AGAAATGGAC TACGATTGGT CCGATGATAT GACTCATGTT
CCTTTTGGCC TTGTAACAAA AGGAGGAGCA AAATTATCAA CTCGTAAAGG AAATGTAATT
CTACTTGAAC CTACTGTTGC AGAAGCTATT AATCGTGCGG CATCTCAGAT TGAAGCTAAA
AACCCTAACC TAGCAGATAA AGATAAAGTT GCTCAAGCAG TTGGTGTAGG TGCTATTAAA
TTTTACGATT TGAAAACTGA TCGTACAAAT GGTTATGACT TTGACTTAGA AGCAATGGTA
TCATTTGAAG GTGAAACTGG GCCATATGTA CAATATGCAC ATGCTCGTAT TCAATCAATC
TTACGTAAAG CTAACTTTAG CCCATCTAAT AGCGATAATT ACAGTCTCAA CGATGTTGAA
AGCTGGGAAA TCATCAAACT CATCCAAGAT TTTCCTCGTA TTATTGTACG TGCGGCAGAT
AATTTTGAGC CCTCTATTAT TGCAAAATTC GCTATTAATT TAGCTCAGTG CTTCAATAAA
TATTACGCCC ACACACGTAT TTTAGATGAA GACGCTGAAA TCAGTAGTCG TCTAGCACTT
TGCTATGCAA CTGCTACAGT ATTAAAAGAA TCTCTTCGCC TCCTAGGAGT TGATGCTCCG
AATGAAATGT AA
 
Protein sequence
MDTKHLIASE IQKVVPDMEQ STILSLLETP KNSSMGDLAF PAFSLAKTLR KAPQIIASDI 
AEQIKSDQFE KVEAVGPYVN FFLDKAAISS QVLKQVLSDG SAYATQNIGE GRNVAIDMSS
PNIAKPFSIG HLRSTVIGDS LANIFDKIGY HPVKINHLGD WGKQFGMLIV AYKKWGNEEA
VRAHPIDELL KLYVRINAEA ETDPSVDEEA REWFRKLEAN DPEATELWQW FRDESLLEFN
RLYDQMNVTF DSYNGEAFYN DKMDEVLELL ESKNLLVESK GAQVVNLEKY GIEHPALIKK
SDGATLYITR DLAAALYRKR TYDFAKSIYV VGNEQSAHFK QLKAVLKEMD YDWSDDMTHV
PFGLVTKGGA KLSTRKGNVI LLEPTVAEAI NRAASQIEAK NPNLADKDKV AQAVGVGAIK
FYDLKTDRTN GYDFDLEAMV SFEGETGPYV QYAHARIQSI LRKANFSPSN SDNYSLNDVE
SWEIIKLIQD FPRIIVRAAD NFEPSIIAKF AINLAQCFNK YYAHTRILDE DAEISSRLAL
CYATATVLKE SLRLLGVDAP NEM