Gene Francci3_2642 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2642 
Symbol 
ID3906315 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3117228 
End bp3118622 
Gene Length1395 bp 
Protein Length464 aa 
Translation table11 
GC content70% 
IMG OID637879967 
Productcysteinyl-tRNA synthetase 
Protein accessionYP_481733 
Protein GI86741333 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0215] Cysteinyl-tRNA synthetase 
TIGRFAM ID[TIGR00435] cysteinyl-tRNA synthetase
[TIGR03447] cysteine--1-D-myo-inosityl 2-amino-2-deoxy-alpha-D-glucopyranoside ligase 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value0.736924 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.302095 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTGTCGG CTACGCTGCG GTTACTCTCC TTGTGCCCGC CGGAGAGTCG GTTCGCGGTT 
ACTCTGACAC GCATGCAGGC GTGGCCATCT CCTCCGATAC GTTCTCTTCC CGGTCACGGG
AAGCCACTGA GGATCTTCGA CACGGCCACG TCGAGCGTAC GCGAGTTGGC GCCGGCCGTC
ACCGCAAGGC TGTATGTGTG TGGCATCACG CCCTATGATG CGACGCATCT GGGGCATGCC
TTCACCTACC TCACCTACGA CCTCGCGCAG CGTGTGCTGC GAGACGCCGG GCATCACGTC
CACTATGTAC AGAACGTAAC GGATGTCGAT GACCCGTTGC TTGAGCGAGC CACCCGTGAC
GGGCTGGACT GGCGGGCCCT CGCCGACCGG GAGATCGACC TGTTCCGCGA GGACATGACC
GCGCTGCGGA TGTTGGCGCC GGACGCCTAC GTCGGGGTGG TCGAGGCCAT CCCGATGATC
GTCGACATGG TGGTGGAGCT CGTCGACCGG GGCGCGGCCT ACCAGGTCGA CGACGACCTG
TACTTCTCGA TCGCCACCGC ACCCGCCTTC GGGGAGATCT CGCATCTCAG CCGGGCCGAG
ATGCTGGCGA TCTGCGCCGA GCGCGGGGGT GACCCGCGCC GGACCGGCAA GAAGGACCCC
CTCGATCCGC TGCTGTGGCG CGCCCACCGC CCCGGCGAGC CGTCCTGGCC CTCGCCGTTC
GGCCCCGGCC GGCCCGGCTG GCACATCGAG TGCTCCGCCA TCGCCCGCCA CTATCTCGGC
GGGGTCATCG ACATCCAGGG TGGCGGAACC GACCTGAGCT TTCCGCACCA CGAGTGCAGC
GCGGCGCACG CCGAGGTCGC CGCCGGCATC CGGCCGTTCG CCCGCAGCTA CGTGCACACC
GCGATGGTGA GCCTCGACGG CCACAAGATG TCGAAGTCGC GGGGCAACCT GGAGTTCGTC
TCCCGGCTGC GCCGGGCCGG GGTGGATCCG GCGGCCCTGC GGCTGGCCCT GCTCGATCAT
CGGCACACCG AGGACTGGGA GTGGACGCCG GGCCTGCTCG ACGACGCCGT GGACCGGATG
AACCGGTGGC GGGCCGCCGT CGCCCTGCCC ACCGGGCCTG ACGCCATGGG ACTGCTCGCC
GCCGTGCGTG AGCGGCTCGC CGACGACCTC GACGCCCCGG GTGCCGTCGC CGCGGTGGAC
GCCTGGGTCG GCGCCGCGCT CGCTGATGCG GGCGGCTCCG CCGGTGCGGG CCCGGATCCC
ACCCATCAGG GGGGTCCGGT TCGCGGTTCT GGCGGTGACG TGCCCGCCTG GGGGGAGGCG
CCCGCACTCG TGCGGCGCCT CGTTGACACA CTGCTGGGCG TAGACCTTGA ACCCGTCAGA
CCCAGAGGGA GCTGA
 
Protein sequence
MLSATLRLLS LCPPESRFAV TLTRMQAWPS PPIRSLPGHG KPLRIFDTAT SSVRELAPAV 
TARLYVCGIT PYDATHLGHA FTYLTYDLAQ RVLRDAGHHV HYVQNVTDVD DPLLERATRD
GLDWRALADR EIDLFREDMT ALRMLAPDAY VGVVEAIPMI VDMVVELVDR GAAYQVDDDL
YFSIATAPAF GEISHLSRAE MLAICAERGG DPRRTGKKDP LDPLLWRAHR PGEPSWPSPF
GPGRPGWHIE CSAIARHYLG GVIDIQGGGT DLSFPHHECS AAHAEVAAGI RPFARSYVHT
AMVSLDGHKM SKSRGNLEFV SRLRRAGVDP AALRLALLDH RHTEDWEWTP GLLDDAVDRM
NRWRAAVALP TGPDAMGLLA AVRERLADDL DAPGAVAAVD AWVGAALADA GGSAGAGPDP
THQGGPVRGS GGDVPAWGEA PALVRRLVDT LLGVDLEPVR PRGS