Gene Franean1_0653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0653 
Symbol 
ID5669070 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp762954 
End bp764069 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content75% 
IMG OID641239580 
Productcysteine--tRNA ligase 
Protein accessionYP_001505018 
Protein GI158312510 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.582134 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCCCCG TGCTGCGGCT CGGCGGGGCG CCGCTGCCAG TGGTGGGCCG AGCCCGGGTC 
TACGTGTGCG GCATCACCCC GTACGCCGTC ACCCACCTCG GCCACGCGGC CACCTATCTG
TGGACGGACC TGGCGATCCG GGTGTGGCGC AACGTCGGGG TCCCGGTCGA GCTGGCCCGC
AACATCACTG ACGTCGACGA CGCGATGTTC GACGAGGCCC GCCGGACCGG CCTGCCCTTC
GACCAGATCG CGTCGCTGCA GCGGTTCGCC TTCGACCGCA CGATGACCTC ACTCGGCATC
CGCCCACCCG ACCACGAGCC GACCGCGCGC GCGGCCGTGA CCCGGGTGAT CGAGCTGGCG
ACGGCGCTGC TGCGCGCCGG CCACGCCTAC GAGCGCGGCG GCAGCGTCTA CGCCCGGACA
GCCGAGGCCG CCGAACGCGC CGGCCTCGAC CGGGCAGCCG CGATCGCCCT CGCCGCCGAG
TACAACGACG ACCCGCACGA CCCCGAACGG GACGACCCAC TCGACGTCGC CGTCTGGCGC
GCCGCCCGGC CGGACGGCGG GTACCCGAGC TGGCCCAGCC CGTGGGGCCC CGGCCGGCCC
GGCTGGCACG CCGAGTGCGC GGCGATGGTC CTGTCGACCT TCGGCTCCAG CGTGGACCTG
CACGCCGGCG GCGCCGACCT GCGCTACCCC CACCACGCCG TGGAGGCGCT GCTCGCCGAG
CGCGCCACCG GCGTGCAACC GTTCGCGCGG GCCTGGCTGC GGCCGGGGAC CGTCCGTTCG
GGCGGGGTCA AGATGTCCAA GTCGCTGGGC AACCTGACCT TCGTCGACGA CCTGCTGACC
AGGCACAGCC CGGCCGCGGT GCGGCTGCTC TGCCTGGTGC GCCCCCGGGA CGACGACTGG
GACTTCGACG AGGCTGCGTT CGACGAGGCC GAGGCCGGCC TGGACCTCCT CTACTCCGCC
GCCGGCCGCC CCGGCGCCGT CCGGGGCGCC TCCGCCGTGG ACGAGGTCGA CGCCGCGCTG
CTCGACGATC TCGACACCGT CCGGGCCCGG TCCATCGCTC TGGACTCCGG CGGTACCGCG
GCCCGGCGGT TCATCTCCGT CCTCGGGCTC ACCTGA
 
Protein sequence
MRPVLRLGGA PLPVVGRARV YVCGITPYAV THLGHAATYL WTDLAIRVWR NVGVPVELAR 
NITDVDDAMF DEARRTGLPF DQIASLQRFA FDRTMTSLGI RPPDHEPTAR AAVTRVIELA
TALLRAGHAY ERGGSVYART AEAAERAGLD RAAAIALAAE YNDDPHDPER DDPLDVAVWR
AARPDGGYPS WPSPWGPGRP GWHAECAAMV LSTFGSSVDL HAGGADLRYP HHAVEALLAE
RATGVQPFAR AWLRPGTVRS GGVKMSKSLG NLTFVDDLLT RHSPAAVRLL CLVRPRDDDW
DFDEAAFDEA EAGLDLLYSA AGRPGAVRGA SAVDEVDAAL LDDLDTVRAR SIALDSGGTA
ARRFISVLGL T