Gene Franean1_4892 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4892 
Symbol 
ID5673232 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5870770 
End bp5872104 
Gene Length1335 bp 
Protein Length444 aa 
Translation table11 
GC content74% 
IMG OID641243747 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001509163 
Protein GI158316655 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase
[TIGR03447] cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00699487 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0480407 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCTGCT CGTCGCCGGC CTGCCCGGCT TGTCCCGCGG CCGGCCCAGC CCCGGTTACC 
CTGACACGCA TGCAGGCGTG GCCCTTTCCC CAGCTCCCCA AGCTTCCCGG ACAGGGTCGC
GACCTCCGCG TTCTCGACAC GGCGCACGGC GGAGTGCGGA CTCTGGACCT CGGGCCGACG
GTCCGGCTCT ACGCCTGCGG GATCACCCCG TATGACGCCA CCCACCTCGG GCACGCCTTC
ACGTACCTCA CCTACGACCT CGTCCAGCGC GTGCTGCGGG ACGCCGGCCA CGAGGTGCGG
TACGTGCAGA ACGTCACCGA CGTCGACGAT CCGCTGCTCG AGCGGGCCAC CCGCGACGGC
ATCGACTGGC GCGACCTCGC CCGCCGCGAG ATCGACCTGT TCCGGGCCGA CATGACGGCG
CTGCGCATCC TGCCCCCCGA CCACTACGTC GGCGTCGTCG AGGCCGTCGG GCTCATCGTC
GACATGGTCT CCCAGCTCGT CGAGCGCGGC GCCGCGTACT CCGTCGACGG CGATCTCTAC
TTCTCCGTCG CCGCGGCGCC GGACTTCGGC CAGGTGGCCC ACCTCGACCC CGCGCAGATG
CTGGTGTCCT GCGCCGAGCA CGGCGGTGAT CCCGGCCGGC CGGGCAAGAA GGACCCGCTC
GACCCGCTGC TGTGGCGCGC GGAGCGGCCG GGGGAGCCGT CCTGGCCGTC GCCGTTCGGC
CCGGGGCGGC CGGGCTGGCA CGTGGAGTGC TCCGCGATCG CCCGGCACTA CCTCGGCGCC
ACGATCGACA TCCAGGGCGG CGGGAGCGAT CTCGCCTTCC CACACCACGA GTGCAGCGCC
GCGCACGCCG AGGTCGCGAA CGGCGCCCGG CCGTTCGCCC GCGCCTACGT CCACACCGCG
CTGGTCAGCC TGGACGGCCA CAAGATGTCG AAGTCGCGGG GGAACCTCGA GTTCGTCTCG
AGGCTGCTCG CGCGCGGAGC GGACCCGGCC GCGATCAGGC TCGCCCTGTT GCAGCATCAT
CACACCGTGG AGTGGGAGTG GACCGCGGCC GCCATGCCGG CGGCAGCCGA GCGGCTCGAC
CGGTGGCGCG CCGCCGTCGC GCTGCCGTCC GGCCCAGACT TCCGGCCGGT GCTGGCCGAG
GTCCGGGACC GGCTCGCCGA CGACCTCGAC GCGCCTGGAG CGCTCGCCGC GGTCGACGCG
TGGGCGGCGG CCGCACTGGC CGCTGGCAGC GGCGGCAGCG GCGAGGCGGA CGACCAGGCG
CCGGCCACCG TCCGCGACAC CGTCGACTCA CTGCTCGGTG TCGACCTTGG GCCTGTCCTA
CCGAGAGGAA CCTAG
 
Protein sequence
MPCSSPACPA CPAAGPAPVT LTRMQAWPFP QLPKLPGQGR DLRVLDTAHG GVRTLDLGPT 
VRLYACGITP YDATHLGHAF TYLTYDLVQR VLRDAGHEVR YVQNVTDVDD PLLERATRDG
IDWRDLARRE IDLFRADMTA LRILPPDHYV GVVEAVGLIV DMVSQLVERG AAYSVDGDLY
FSVAAAPDFG QVAHLDPAQM LVSCAEHGGD PGRPGKKDPL DPLLWRAERP GEPSWPSPFG
PGRPGWHVEC SAIARHYLGA TIDIQGGGSD LAFPHHECSA AHAEVANGAR PFARAYVHTA
LVSLDGHKMS KSRGNLEFVS RLLARGADPA AIRLALLQHH HTVEWEWTAA AMPAAAERLD
RWRAAVALPS GPDFRPVLAE VRDRLADDLD APGALAAVDA WAAAALAAGS GGSGEADDQA
PATVRDTVDS LLGVDLGPVL PRGT