Gene Francci3_3901 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3901 
SymbolleuS 
ID3906669 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4667018 
End bp4670212 
Gene Length3195 bp 
Protein Length1064 aa 
Translation table11 
GC content71% 
IMG OID637881227 
Productleucyl-tRNA synthetase 
Protein accessionYP_482980 
Protein GI86742580 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0495] Leucyl-tRNA synthetase 
TIGRFAM ID[TIGR00396] leucyl-tRNA synthetase, eubacterial and mitochondrial family 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACGAG CGATGAGTGA GACGGCCGAG CCCGGCGCCC GGACCGGCGC CGCGGACACC 
ACGGTGGCAC CGACCGGTGC GTCTGGCGGG ATCATCCCGG CAGCCGCGGG CACCGCGGGC
GGCGCCCCCG CCGGGACCGG GAGCGTCGAG CCGAGCTTCC GGTACGACGC CCGGCTCGCC
GCCGACATCG AGCGGCGCTG GCAGCGCCGG TGGGCCGACG AGGGCACGTT CAACTCGCCG
AACCCGGTCG GGCCGCTCTC GACGGGCTTC GAGAAGGTCG CCGGCCGGGA GCCGTTCTAC
ATCATGGACA TGTTCCCCTA TCCGAGCGGC TCGGGGCTGC ACGTCGGGCA CCCGCTGGGG
TACATCGGCA CCGACGTGTT CGCCCGCTAT CTGCGGATGT CCGGCCGGCA CGTGCTGCAT
CCGTTCGGAT ACGACTCCTT CGGCCTGCCC GCCGAGCAGT ACGCCATCAA CACGGGCCAG
CATCCCCGCG ACACCACCAA CGCCAACATC GCCAACATGC GCCGCCAACT CTCCCGGCTG
GGGCTGGGCC ACGACACCCG CCGCGAGATC GCGACGACCG ACGTGGGCTA CTACCGCTGG
ACGCAGTGGA TCTTCCAGCA GATCTTCAAC AGCTGGTACG ACCCGCAGGC CGGCCGGGCC
CGGCCGATCG CCGAGCTGAT CGAGGAGTTC GCCGCGGGCA CCCGGGCGCC GGTAGCCGGC
CCCGCCGGGG GGAACACGGC CGTGTCGGTT GACGCGGTCC GGGCGGCGAA CCCGGCCGGG
CTGGCGTGGA CCGAACTCGA CGAGGTGTCC CGCCGGAAGG TCGTCAACGC GCACCGCCTG
GCCTACATCT CCGAACAGCT GGTCAACTGG TGCCCGGGGC TGGGCACCGT GCTGGCGAAC
GAGGAGGTCA CCGCCGACGG CCGCAGCGAC ATCGGGAACT ACCCAGTGTT TCGCCGGCCG
CTGAAGCAGT GGATCCTGCG GATCACCGCC TATGCCGAGC GGCTGATCTC CGACCTCGAC
CTGGTCGACT GGCCGGACTC GATCAAGCAG ATGCAGCGCA ACTGGATCAG TCCGAGCGAG
GGCGCCAGCG TCGAGTTCAC CGTCGTCGCC CCCGGCGAGG AGGCAGGTGC GTCCGATCCG
TCTGGCTCAT CGACCGCCCG GCGTATCGAG GTCTATACCA CCCGCCCGGA CACCCTGGCG
GGGGCCACCT TCCTGGTACT CGCGCCCGAA CATCCCCTGG CCGACGCCCT GATCGCCGAC
ACCTGGCCGG CGGACACCCC GGTGAGCTGG CGCTTCCCGG CGGGACGGCC GGGCGGCGGC
ACGGAACCGG CGGACACCGC CGGGCCGGAG GCGGGCGCCG ATCCGGCGTG GACCCCGCGA
GCCGCCGTCG ACGCCTACCG GGAGTTCGCG GCCCACCGCA GCGACCGGCA GCGCGGCGAG
GAGGTCATCG ACCGCACCGG CGTGTTCACC GGCTCGTACG TGCGCAACCC GGTCGGTGGC
GGGGTCATCC CGGTCTTCCT GGCCGACTAT GTGCTGCTGG GCTACGGCAC CGGGGCAATC
ATGGCGGTAC CGGCGCACGA CAGCCGGGAC TTCTCCTTCG CCCGCGCGTT CGACCTGCCG
ATCCCCGCCG TGCTGGAGCC GGACGCGGAC TGGTACGCCG CGCACGGGGT AGTGCCCGCG
ACTCCATCAG CGCAGTGGCC CGAGGCGTTC AGCGGTGCGG GCGAGTATCG GCCCGGTCCG
GCCAGCGCCC CGGTGCTGGT CGGCCTGTCG AAAAGCGAGG CGATCAAGGC CACGGTTCAC
TGGCTGGAGG AGATCGGCGC CGGCAGGTCG GCGCGGTCGT ACCGGCTGCG GGACTGGCTG
TTCTCCCGCC AGCGGTACTG GGGCGAGCCG TTCCCGATCG TCTTCGATGT CGACGGGCTG
CCCCACGCGG TTCCCGACGA GCTGCTGCCG ATCGAACTGC CGGAGATGAC CGACTTCCGG
CCCACGGCGA TGGCCGAGGA CGACGCGAGC GACCCGGTGC CCCCCCTGGC CCGGGTGGCC
GACTGGGTGA CGGTCACCCT GGATCTCGGC GACGGGCCGA AGCAGTACCG GCGCGAGACG
AACACCATGC CGCAGTGGGC CGGTTCGTGT TGGTACTACC TGCGCTACCT GGACCCGACC
AACACCGAGC GCTTCGTCGA CCCGACCGTC GAGCGCTACT GGATGGCCAG GCCGGGCGCG
GTTCCCGGCG ACGGCGGCGT CGATCTGTAC GTCGGCGGTG TCGAGCACGC CGTGCTGCAC
CTGCTCTACG CCCGGTTCTG GCACAAGGTG CTCTACGACC TGGGCCACGT CTCCACCAAG
GAGCCGTTCA AGCGGCTGTT CAACCAGGGA TACATCCAGG CGGATGCCTT CACCGACGCC
CGGGGCATGT ACGTCCCGGC GGCCGAGGTG ACGGCGACCC CCGACGGCCG GTTCCTCTTC
CAGGGCGCCC CGGTCAACCG GCGCTCGGGC AAGATGGGCA AGAGCCTGAA GAACAGCGTC
AGCCCGGACG AGATGTACGA CAGGTTCGGC GCCGACACGC TGCGCGTCTA CGAGATGGCG
ATGGGCCCGC TCGACGCTGA CCGGCCATGG CACACCGACG ACATCGTCGG TTCGCACCGG
TTCCTCCAGC GGCTGTGGCG CACCGTCGTC GACGAAACCA CCGGGGCGGC CGCCGTCGTT
GACGAGCCGT TGGACGACGA GGCTCTTCGC GTCCTGCACC GGACGATCCT CACGGTCACC
GCCGAATACG CGGGGCTGCG GTTCAACACC GCGGTCGCCC GGCTCATCGA ACTAACCAAC
TTCGTCAGCA AGAGCTACGG GAAATCCCCC ACCCCCCGCG CGCTCGCCGA GCCGCTCACC
CTGATGGCGG CCCCGCTGGC CCCGCACATC GCCGAGGAAC TGTGGTCCCG CCTCGGTCAC
GAGGAGTCGG TCAGCACGGT CGCCTTCCCG ATCGGGGATC CGGCGCTGGC CGCCGAGTCG
GTCAGGACGA TCCCGGTCCA GGTGAACGGG AAGGTCCGGT TCACCATCGA GGTTCCGGAC
GGTTCAGCGG AGCAGACGGT TCGCGATCTG CTCGCCGCAC ATCCCGAGTT CGCCCGGCAG
ACCGATGGTC GGACGATCAA GAAGATCATC GTCGTGCCCG GTCGGATCGT GAATATCGCC
ATCTCCCCCG CCTAG
 
Protein sequence
MARAMSETAE PGARTGAADT TVAPTGASGG IIPAAAGTAG GAPAGTGSVE PSFRYDARLA 
ADIERRWQRR WADEGTFNSP NPVGPLSTGF EKVAGREPFY IMDMFPYPSG SGLHVGHPLG
YIGTDVFARY LRMSGRHVLH PFGYDSFGLP AEQYAINTGQ HPRDTTNANI ANMRRQLSRL
GLGHDTRREI ATTDVGYYRW TQWIFQQIFN SWYDPQAGRA RPIAELIEEF AAGTRAPVAG
PAGGNTAVSV DAVRAANPAG LAWTELDEVS RRKVVNAHRL AYISEQLVNW CPGLGTVLAN
EEVTADGRSD IGNYPVFRRP LKQWILRITA YAERLISDLD LVDWPDSIKQ MQRNWISPSE
GASVEFTVVA PGEEAGASDP SGSSTARRIE VYTTRPDTLA GATFLVLAPE HPLADALIAD
TWPADTPVSW RFPAGRPGGG TEPADTAGPE AGADPAWTPR AAVDAYREFA AHRSDRQRGE
EVIDRTGVFT GSYVRNPVGG GVIPVFLADY VLLGYGTGAI MAVPAHDSRD FSFARAFDLP
IPAVLEPDAD WYAAHGVVPA TPSAQWPEAF SGAGEYRPGP ASAPVLVGLS KSEAIKATVH
WLEEIGAGRS ARSYRLRDWL FSRQRYWGEP FPIVFDVDGL PHAVPDELLP IELPEMTDFR
PTAMAEDDAS DPVPPLARVA DWVTVTLDLG DGPKQYRRET NTMPQWAGSC WYYLRYLDPT
NTERFVDPTV ERYWMARPGA VPGDGGVDLY VGGVEHAVLH LLYARFWHKV LYDLGHVSTK
EPFKRLFNQG YIQADAFTDA RGMYVPAAEV TATPDGRFLF QGAPVNRRSG KMGKSLKNSV
SPDEMYDRFG ADTLRVYEMA MGPLDADRPW HTDDIVGSHR FLQRLWRTVV DETTGAAAVV
DEPLDDEALR VLHRTILTVT AEYAGLRFNT AVARLIELTN FVSKSYGKSP TPRALAEPLT
LMAAPLAPHI AEELWSRLGH EESVSTVAFP IGDPALAAES VRTIPVQVNG KVRFTIEVPD
GSAEQTVRDL LAAHPEFARQ TDGRTIKKII VVPGRIVNIA ISPA