Gene Franean1_0366 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0366 
SymbolcysS 
ID5668790 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp434305 
End bp435825 
Gene Length1521 bp 
Protein Length506 aa 
Translation table11 
GC content73% 
IMG OID641239298 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_001504738 
Protein GI158312230 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0158714 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00296969 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGGTCTTC ACCTCTATGA CACGCGCACC CGGCGCGTGC GGCCGTTTGT GCCGCTGCGC 
CCCGGTCATG TGGGCGTGTA CGTGTGTGGG CCCACCGTCC AGTCTCCGCC GCATGTGGGC
CACATCCGTA CCGCGCTCGC GTTCGACCTG CTGCGGCGCT GGCTGACGCA GAGCGGCCTG
TCGGTGACGT TCGTCCAGAA CGTCACCGAC ATCGACGACA AGATCATCGT CAACGCCGAC
CGGGCGGGGA CGAGCGTGTG GGAGCTCGCC ACCAGGCAGA CCCGGGCGTT CGAGGACGCC
TACCGGGCCG TCGGGATCCT GCCCCCGACG ATCTCCCCGC GGGCCACCGG GCACATCCCC
GAGATGCTCG AGATGATCGC CGTCCTGGTC GAACGGGGGT TCGCCTATCC CGGCGCGGGC
TCGGTGTGGT TCCGCGTCGG CGCGTTCGCG GACTATGGTG CGCTCTCCCA TCAGCGGCCC
GCCGCGATGC ACCCGTCCCC GGAGTCCGAG CCGGGCAAGG CCGACCCCAG GGACTTCGCC
CTGTGGAAGG CCGTGAAGCC CGGTGAGCCG TCCTGGTCCT CTCCCTGGGG CCCTGGCAGG
CCGGGCTGGC ACCTGGAGTG CTCAGCGATG GTGGGCAAGT ACCTCGGCGA CGTGTTCGAC
ATCCACGGCG GCGGGCTCGA TCTGGTGTTC CCGCATCACG AGAACGAGCG GGCGCAGTCC
GTGTGCGCGG CGGAGCTGGC CGCCGCGGGC GCCGCGCCCG CGGCGGCCAG CTCCCGGGGT
GCCGTGGCGG CTGGCGCCGA CGTGCCGGGT GCCGGCGGGG CGGGCGAGAT GGCGCGCTAC
TGGATGCACG TGGGTCTGCT CACCACCGGT GGCACCAAGA TGTCGAAGTC GCTGGGCAAC
TCGTTCTTCG TGACCGACGC GCTCGCCGCG GTACGCCCCC AGGTGCTGCG CTACCACCTG
CTCTCCGCGC ACTACCGCTC CTCGCTGGAG TACAGCGCGC AGACCCTGGA AGAGTCCGCG
GCGGCGCATG ACCGGATCGA GACGTTCGTC CGCAACGCCC TGGACATCCT GGGCGGCCCG
GCGGAGGCCG CGGCGCTTGC CGCCGAGGCC GATCGCGTGG CGGGCACCGA GGCTGCGGAC
GCGGACGGGG CCTGGTCCGA GTTCGCCGCG GCGCTGGACG ACGACCTCGG CGTCGGCCGC
GCGCTGGCCG CCCTGTTCGG CGTGGTCGGC CGGGGCAACC AGGTGCTGTC GAAGACCCAC
AGCCGCGAGC TGGCTGGCTG GGTGGACGTC GCCCGCCGGA TGCTGACCGT CCTCGGGCTC
GACCCGGTCG AGCAGTGGCC GACGGCGGGC GCCGAGCTGC GTCCCGCGCT CGACGGCGTC
ATGGACGTCC TGCTGGACCT GCGCTCGGCG GCCAGGGCCC GGCGGGACTA CAGCGAGGCC
GACTCCATCC GCTCCCGGCT CGCGGCGGCC GGTGTGGTCA TCGAGGACAC GCCCGAAGGG
CAGCGCTGGC ACCTCACGTA G
 
Protein sequence
MGLHLYDTRT RRVRPFVPLR PGHVGVYVCG PTVQSPPHVG HIRTALAFDL LRRWLTQSGL 
SVTFVQNVTD IDDKIIVNAD RAGTSVWELA TRQTRAFEDA YRAVGILPPT ISPRATGHIP
EMLEMIAVLV ERGFAYPGAG SVWFRVGAFA DYGALSHQRP AAMHPSPESE PGKADPRDFA
LWKAVKPGEP SWSSPWGPGR PGWHLECSAM VGKYLGDVFD IHGGGLDLVF PHHENERAQS
VCAAELAAAG AAPAAASSRG AVAAGADVPG AGGAGEMARY WMHVGLLTTG GTKMSKSLGN
SFFVTDALAA VRPQVLRYHL LSAHYRSSLE YSAQTLEESA AAHDRIETFV RNALDILGGP
AEAAALAAEA DRVAGTEAAD ADGAWSEFAA ALDDDLGVGR ALAALFGVVG RGNQVLSKTH
SRELAGWVDV ARRMLTVLGL DPVEQWPTAG AELRPALDGV MDVLLDLRSA ARARRDYSEA
DSIRSRLAAA GVVIEDTPEG QRWHLT