Gene Francci3_4251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4251 
SymbolcysS 
ID3907218 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5073632 
End bp5075206 
Gene Length1575 bp 
Protein Length524 aa 
Translation table11 
GC content70% 
IMG OID637881577 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_483326 
Protein GI86742926 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGTCTCC ACCTCTACGA CACGCGCAGG CGGCGTGTGC GGCCATTCGA GCCGCTACGC 
CCCGGTCACG TGGGCGTCTA CGTGTGTGGG CCCACCGTGC AGGCGGCCCC GCACGTGGGC
CACATCCGCA CGGCGCTGCC CTTCGACCTG CTGCGCCGGT GGCTGGTGCA GTCCGGCCGT
TCGGTGACCT TCGTGCAGAA CGTCACCGAC ATCGACGACA AAATCATCAT CAACGCGGAT
CGGGACGGCA CCTCGGTCTG GGAGCTCGCC ACCCGGCAGA CCCGCGCGTT CGACGACGCC
TACCGCACTC TCGGGATCCT GCCCCCGACG ATCCAGCCGC GCGCCACCGG TCACATCCCC
GAGATGATCG CGCTGGTGTC GGCCCTCGTG GAGGGGGGCT ACGCGTACGC CAGCGGCGGT
TCGGTCTGGT TCCGGGTGGG TGCGTTCGCC GACTACGGAG CACTGTCCCA CCAGCGACCC
GACGCGATGC AACCGTCCGT GGAAGCCGAG CCCGGCAAGG CCGACCCCCG CGACTTCGCG
TTGTGGAAGG CAGCCAGGCC CGGTGAACCG TTCTGGTCGT CCCCCTGGGG TGACGGCCGG
CCGGGCTGGC ATCTGGAATG CTCTGCGATG GCCGGAAAAT ACCTCGGCCC GGTCTTCGAC
ATCCACGGCG GCGGACTCGA CCTGGTTTTC CCGCACCACG AGAACGAACG GGCCCAGACG
GTCTGCGCCG CCACGGCCCG CCCGGCGTCG AACGCGGCTT CGGCCGATTC GCCCGGCCCC
GGTGGTGGCG AGCCCGGTGG TGGCGAGCCC AGTAGCGGCG AGATGGCGCG CTACTGGATG
CACGTCGGCC TGCTGACGAC CGGCGGGACG AAGATGTCGA AGTCGTTGGG AAATTCCGTC
CTGGTGGCCG ACGCCCTCGA CGCGGTCCGT CCGCAGGTTC TGCGTTATCA CCTGCTTTCC
GCGCACTACC GGTCCACGCT CGAGTACAGC GCCGAGGCAC TGGCGGAATC CACCGCGGCC
CACGACCGGG TGGAGACCTT TGTCCGCAAC GCGTTGGACA TCCTCGGCGG GCCCGGGGAG
GCGGCGGCAC TGGCGGCCGA CGAGGTCGTC TCGTCGGTGG CAGGGGCGCA GCCGACGGTG
GCCGGAGCCC GGCCGGTGCC GGTTCCCGGG CCGGGTGGTG AGTCGAGGCT CACTCCGCGG
CGGGCCTGGT CCGATTTCAC CATCGCCATG GACGATGATC TCGCCGTCGG GCGTGCGCTG
GCGGCCCTGT TCGGCGCGGT GAGTCAGGGA AACCAGGTGC TGTCGAAGGC CCACAGCCGG
GAGCTGGCCG GGTGGGTGGA CGTGACCCGC CGGATGCTGA ACATCTTCGG TCTCGACCCG
CATGAGCAGT GGCCCACCGC AGGTGCCGAG TTCAGGCCGG CGCTGGACGG CGCGATGCAG
GTCGTGCTGG ATCTGAGGTC CGCGGCCCGG GCCCGACGGG ACTACGCGGA AGCCGACGCC
ATCCGGTCGA GACTGGCAGC CGCCGGCCTG ATCGTGGAGG ACACTCCAGA GGGCCAGCGC
TGGCATCTGG CCTGA
 
Protein sequence
MGLHLYDTRR RRVRPFEPLR PGHVGVYVCG PTVQAAPHVG HIRTALPFDL LRRWLVQSGR 
SVTFVQNVTD IDDKIIINAD RDGTSVWELA TRQTRAFDDA YRTLGILPPT IQPRATGHIP
EMIALVSALV EGGYAYASGG SVWFRVGAFA DYGALSHQRP DAMQPSVEAE PGKADPRDFA
LWKAARPGEP FWSSPWGDGR PGWHLECSAM AGKYLGPVFD IHGGGLDLVF PHHENERAQT
VCAATARPAS NAASADSPGP GGGEPGGGEP SSGEMARYWM HVGLLTTGGT KMSKSLGNSV
LVADALDAVR PQVLRYHLLS AHYRSTLEYS AEALAESTAA HDRVETFVRN ALDILGGPGE
AAALAADEVV SSVAGAQPTV AGARPVPVPG PGGESRLTPR RAWSDFTIAM DDDLAVGRAL
AALFGAVSQG NQVLSKAHSR ELAGWVDVTR RMLNIFGLDP HEQWPTAGAE FRPALDGAMQ
VVLDLRSAAR ARRDYAEADA IRSRLAAAGL IVEDTPEGQR WHLA