Gene Francci3_1275 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1275 
Symbol 
ID3905080 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1520749 
End bp1523802 
Gene Length3054 bp 
Protein Length1017 aa 
Translation table11 
GC content73% 
IMG OID637878609 
Productglycine--tRNA ligase 
Protein accessionYP_480382 
Protein GI86739982 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0751] Glycyl-tRNA synthetase, beta subunit
[COG0752] Glycyl-tRNA synthetase, alpha subunit 
TIGRFAM ID[TIGR00211] glycyl-tRNA synthetase, tetrameric type, beta subunit
[TIGR00388] glycyl-tRNA synthetase, tetrameric type, alpha subunit 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.208463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGACCA TGCAGGACGC GCTGCTGGCG CTGACCCGTT ACTGGACCGA CCGCGGCTGC 
ATGATCGTAC AGCCGTTCAA CACCGAGGTC GGCGCCGGTA CGCTCAACCC AGCCACGGTG
CTGCGGGTGC TCGGACCGGA GCCGTGGCGG GTCGCGTACG TCGAACCTTC GGTCCGTCCC
GACGACTCCC GCTACGGCCA CAACCCGAAC CGGCTGCAGA CCCACACCCA GTTCCAGGTC
ATCCTCAAAC CGGATCCGGG TAACCCCCAG GAGCTCTACC TGGGTAGCCT CGCGGCGCTC
GGCATCGACA CCGCCGCCCA CGATGTCCGG TTCGTCGAGG ACAACTGGGC CTCGCCCGCG
TTCGGTGCCT GGGGGCTGGG CTGGGAGGTC TGGCTCGACG GCCTGGAGAT CACCCAGTTC
ACCTACTTCC AGCAGGCCGG TGGCCAGACG CTCGACACCG TCAGCGTCGA GATCACCTAT
GGCGTGGAAC GCATCCTCAT GGCCCTGCAG GGCGTCCGCC ACTTCAGCGA GATCGCTTAC
GCCCCCGGCA TCTCCTACGG CGAGGTGTTC GGCCAGGCCG AGTACGAGAT GTCCCGGTAC
TACCTCGACG ACGCCGACAT CGACACCGTG CGCCGGCTCT ACGCCGACTA CGCCGCCGAG
GCCCGCCGGC TCATCGACGC CCGGCTGCCC GTCCCCGCGC ACTCCTACGT GCTGAAGTGC
TCGCACGCCT TCAACATCCT CGACTCCCGG GGCGCGGTCT CTACCACCGA GCGGGCCACC
TCCTTCGCGC AGATGCGTGG ACTGTCGCGG GAGGTCGCGA CGCTGTGGCG GGACCGTCGC
GAGGAGCTCG GCCATCCCCT CGGGGCGGCT TCCCCGCCAC CGGCCGCGGT GCCCGCCGCG
GTGACCACCC GGGCCGAGGA GCCGGCCACC CTGCTGTTCG AGATCGGGAC GGAGGAGCTC
CCGGCGGCCG AGGTGACCCG GACCGTCGAG GCGGTGCGCG CCGGCCTCGT TGAGCGGCTG
GCGGCCACCC GCCTCACGCA CGGTGCGGTC CGGGTGCTGG GCACTCCCCG GCGCATCGTG
GCGATCGTTG ACGAGGTGGC GCCCCGGGAG CCGGACGTCG AGCGGGTCGT CCGCGGCCCG
CGCGTCTCGG CCGCCTACGA CGCCGCCGGC GCGCCGACGA AGGCCGCCGT CGGGTTCGCG
CGTGGTCAGG GTGCGGATCC CGCCGGCCTC CAGGTCGTCA CCCACCGCGG TGTCGAGCAT
GTCGCGCTCG TGCGCACCGA GGCCGGCCGG GACGCCGCCC GGGTGCTGGC CGGGGTGCTC
GGTGAGCTGG TGGTGGGGCT GCGCGCGGAG CGCAACATGC GCTGGAACGA TCCGACCTTG
TCGTTCAGCC GGCCGGTGCG CTGGCTGCTC GCCCTGCTCG GCGAGACCGT GGTGCCGGTG
ACGGTGTCGA CGCTCGCGGC CGGTCGGACC ACCCGCGGCC ACCGGCGGGC CGGCTCCCCG
CCGCTGGACG TGCCGAGCGC GTCCGGCTAC CCAGAACTGC TCGCTGCCCG GTCGATCCTG
CTCGACCCGG TGGTTCGCCG CGAGCTCGTC GTCGAGGAGG CGGCCAAGCT CACCGCGGAC
TCCGGCGGTC ACATCGACCT CGTCGCCGAG GCGGACGTGC TCACCGAGGT GGCGAACCTC
GTCGAGTTCC CGTGCCCGAT CCTCGGGTCC TTCGAGGAAC GCTTTCTGGA CCTGCCGGCG
GAGATCCTCA CCACCGTCAT GCGCAAGCAC CAGCGCTACC TGCCGGTGCG TGACGCCGCC
GGGCGGCTGC TGCCGTCGTT CGTAGCCGTC GCCGATGGTA AGGTCGACCA AGCGCTGGTC
CGCGCCGGCA ACGAGGACGT CATCCGGGCC CGGTTCACCG ACGCCGCGTT CTTCTACGAC
GCCGACATCA CGGTGCCACT CGAGACGTTC CGGGCGGAGC TCGCGAAGCT GACGTTCGAG
CAGCGGCTCG GCTCCGTCGC GGACCGGGCC GACCGCATCG CGGCGCTGGC GCGTGACCTC
GCCGAGGAGA TCGACCTCGG TGAGGACGAC CGGGCCACCC TGGACCGGGC GGCCGCGCTC
GCCAAGTTCG ATCTGGCCAC CCAGATGGTC ATCGAGCTGT CGAGCCTCGC CGGCACGATG
GCGCGGGAGT ATGCCCGCCG GGCCGGTGAG CCCGAGGCGG TCGCCGTCGC CCTCTTTGAG
ATGGAGCTGC CCCGCCGGGC CGGTGACCAG CTGCCCGATA CCGCCCCCGG TGGCCTGCTG
GCCCTCGCCG ACCGGCTCGA TCTGCTTGTC GGGCTGTTCG CGCTCGACGC GGCCCCCACG
GGGAGTTCGG ATCCGTTCGG GTTGCGCCGG GCCGCGCTCG GGGTCGCCGC GATCCTGGGC
AGCCGGCCTG AGCTCGCCGG CCTCACCGTC ACCGGCGCGC TGGCCCGCGC CGCCCGGCTC
ACCCCGGTGC CGGTGTCCGC CGAGGCGCTC GAGGCCGCCG AGACGTTTGT CCGTGGCCGC
TACGTCCAGC AACTACTCGA TACCGGAGTG GACCACCGCC TGGTCGGCGC GGTCGGGCCG
CTCACCGGCA CGCCGGCCCG CGCGGCCGCC ACCTTGGTGA CACTGCGGCA GCTCGTCCAG
GGGGAGTCCG CACCCGCCGC CGGCTTCGCC GCCCTGGCCG CCGCGTTGCA GCGGGTCCGC
CGGATCGTGC CGGCCGGGAC CCCGGCGGTG CTCGACGCGC AGCGCCTCAC CGAACCCGCC
GAGATCGACC TGCTGGAGGT CGTGACGGCG CTCGTCCGGC GGCTCGCGTC CTCCGGCGCC
GATTCGGCTG TCACCGCGGA GACGGCGGCC GGAGCGATCT CCCTTGATGA GCTGGTCGTC
GCAGCCGGAG ACCTGCCGGC CGCCGTCGAC GCGTTCTTCG ACGCGGTGAT GGTGATGGCC
GACGACCCGG CGGTGCGGGC GGCGCGGCTC GGTCTGCTGG CCTGGATCCG GGACCTGACG
ACCGGAGCTC TCGACTGGGA GGCTCTCGGC TCGCTGTCCG CCTCCGGTCG CTGA
 
Protein sequence
MLTMQDALLA LTRYWTDRGC MIVQPFNTEV GAGTLNPATV LRVLGPEPWR VAYVEPSVRP 
DDSRYGHNPN RLQTHTQFQV ILKPDPGNPQ ELYLGSLAAL GIDTAAHDVR FVEDNWASPA
FGAWGLGWEV WLDGLEITQF TYFQQAGGQT LDTVSVEITY GVERILMALQ GVRHFSEIAY
APGISYGEVF GQAEYEMSRY YLDDADIDTV RRLYADYAAE ARRLIDARLP VPAHSYVLKC
SHAFNILDSR GAVSTTERAT SFAQMRGLSR EVATLWRDRR EELGHPLGAA SPPPAAVPAA
VTTRAEEPAT LLFEIGTEEL PAAEVTRTVE AVRAGLVERL AATRLTHGAV RVLGTPRRIV
AIVDEVAPRE PDVERVVRGP RVSAAYDAAG APTKAAVGFA RGQGADPAGL QVVTHRGVEH
VALVRTEAGR DAARVLAGVL GELVVGLRAE RNMRWNDPTL SFSRPVRWLL ALLGETVVPV
TVSTLAAGRT TRGHRRAGSP PLDVPSASGY PELLAARSIL LDPVVRRELV VEEAAKLTAD
SGGHIDLVAE ADVLTEVANL VEFPCPILGS FEERFLDLPA EILTTVMRKH QRYLPVRDAA
GRLLPSFVAV ADGKVDQALV RAGNEDVIRA RFTDAAFFYD ADITVPLETF RAELAKLTFE
QRLGSVADRA DRIAALARDL AEEIDLGEDD RATLDRAAAL AKFDLATQMV IELSSLAGTM
AREYARRAGE PEAVAVALFE MELPRRAGDQ LPDTAPGGLL ALADRLDLLV GLFALDAAPT
GSSDPFGLRR AALGVAAILG SRPELAGLTV TGALARAARL TPVPVSAEAL EAAETFVRGR
YVQQLLDTGV DHRLVGAVGP LTGTPARAAA TLVTLRQLVQ GESAPAAGFA ALAAALQRVR
RIVPAGTPAV LDAQRLTEPA EIDLLEVVTA LVRRLASSGA DSAVTAETAA GAISLDELVV
AAGDLPAAVD AFFDAVMVMA DDPAVRAARL GLLAWIRDLT TGALDWEALG SLSASGR