Gene Franean1_1004 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1004 
SymbolargS 
ID5669418 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1183802 
End bp1185514 
Gene Length1713 bp 
Protein Length570 aa 
Translation table11 
GC content72% 
IMG OID641239933 
Productarginyl-tRNA synthetase 
Protein accessionYP_001505366 
Protein GI158312858 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0018] Arginyl-tRNA synthetase 
TIGRFAM ID[TIGR00456] arginyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCCCCG CCGAGCTCGC CGACACCATC GTCGCGGCCG TTCGGGCTGC CGTCGCGAAC 
GGCGACCTGG AAGTGGCCGT GCCCGACTCG GTCACCGTCG AGCGACCGAG GCAGCCCGAG
CACGGAGACT ACGCGTCTCC GGTCGCCCTG CAGCTCGCGA AGGCGGCGCG CCGCCGGCCC
CGGGAGGTGG CCGAGCTGCT CGCCGCCCGC CTGCGGGCCG AGGCGGGTGT GGCGGAGGTG
GAGGTCGCCG GCCCGGGCTT CCTGAACATC CGGCTGGCCG GTGCGGCCCT GGGCGGCATC
GCCCGCCGGA TCGTCCGGGA CGGCGAGTCC TACGGCCGCG CCGCGGTGTC CCAGGGCGTC
CGGGTGAACC TCGAGTTCGT CAGCGCGAAC CCGACCGGCC CGGTGACACT GGCGTCCGCG
CGCTGGGCGG CGGTGGGCGA CGCGCTTTCC CGGGTGTTCG CCGCGGCGGG CTACGAGGTC
GGCACCGAGT ACTACGTCAA TGACGCCGGT GTGCAGGTCG AGCGGTTCGG CGCGTCGGTG
CTGGCCGCGC TGCGCGGCCA GCCGGCTCCC GCCGACGGCT ACCAGGGCGC CTACGTCGCC
GAGATCGCCG CGAAGGTCCT GGCGGCGAAC CCGGCTCTGG AGCAGCTCCT GGCCGCCTCC
GGCGGCGAGC AGGACAGCGG CGCCGAGCAG GACAAGGCGC TCGCGGTCTG CGCCCGCGAC
GGCGTGGAGC TGATGCTTGC CGAGATCCGC GCGACGCTTT CGGGGTTCGG GGTCGAGTAC
GACCTGTGGA AGTCCGAGCG CAGCCTGCAC GAGGCGGGGG AGCTCATCGC CGCGATCGAC
GAGCTGCGCA CCCAGGGCCA CGTCTACGAG GCTGGCGGTG CGGTCTGGCT GCGCACCACC
GACTTCGGCG ACGACAAGGA CCGCGCTCTG ATCAAGAGTG ACGGTCGGCC CACCTACTTC
TGTGCCGACG CGGCCTACTA CCGCGACAAG CGGCGCCGTG GCTTCGACCG GCTCTGCTAC
CTGCTCGGAG CCGACCACCA CGGCTACATC GGCCGGCTGA AGGCGATCTC GGCCTGTTTC
GGCGACGATC CCGACCACAA CCTCGACGTG CTCATCGGCC AGATGGTGAC GCTGTCGCGG
GGCGGCGTGG CGGTGAAGAT GTCCAAGCGG GCAGGTAACT TCCTGACCCT GCACGACCTT
GTCGACGCGG TGGGGGTCGA CGCGGCCCGC TACTCGCTGG TGCGCGCGTC GATGGACTCC
GCCCTCGACC TCGACCTGGA CGCGATCGCG CGGCAGACGA ACGACAACCC GGTGTTCTAT
GTCCAGTACG CCCACGCCCG GATCAGTTCG CTCATCCGGA ACGCCGCCGC CCTGGGTCTG
GCGACCTCCG CGGATCCCGC GTTCGACGTC GACGGGGTCG ACGTGTCGCT GCTGACCCAC
CCACGCGAGG TCGACCTGCT GGGCGCGCTC GGCGAGCTGC CGCGGGTGGT CGAGTCGGCG
GCGGAGCTGC GCGCGCCGCA CCGTATCGCG AGGTACCTGG AGGAGTTGGC CGGGACGTAC
CACCGCTTCT ACGACTCCTG TCGCGTGCTT CCGCAGGGCG ACGAGGAGCC CACGGCGATC
ACCGCGGCTC GACTGCTGCT GGCCGAGGCC ACCCGGACCG TCCTGGCGAA CGGCCTTCGG
CTGCTGGGCG TCAGCGCGCC GGAACGGATG TGA
 
Protein sequence
MTPAELADTI VAAVRAAVAN GDLEVAVPDS VTVERPRQPE HGDYASPVAL QLAKAARRRP 
REVAELLAAR LRAEAGVAEV EVAGPGFLNI RLAGAALGGI ARRIVRDGES YGRAAVSQGV
RVNLEFVSAN PTGPVTLASA RWAAVGDALS RVFAAAGYEV GTEYYVNDAG VQVERFGASV
LAALRGQPAP ADGYQGAYVA EIAAKVLAAN PALEQLLAAS GGEQDSGAEQ DKALAVCARD
GVELMLAEIR ATLSGFGVEY DLWKSERSLH EAGELIAAID ELRTQGHVYE AGGAVWLRTT
DFGDDKDRAL IKSDGRPTYF CADAAYYRDK RRRGFDRLCY LLGADHHGYI GRLKAISACF
GDDPDHNLDV LIGQMVTLSR GGVAVKMSKR AGNFLTLHDL VDAVGVDAAR YSLVRASMDS
ALDLDLDAIA RQTNDNPVFY VQYAHARISS LIRNAAALGL ATSADPAFDV DGVDVSLLTH
PREVDLLGAL GELPRVVESA AELRAPHRIA RYLEELAGTY HRFYDSCRVL PQGDEEPTAI
TAARLLLAEA TRTVLANGLR LLGVSAPERM