Gene Francci3_3540 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3540 
Symbol 
ID3904479 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4228558 
End bp4230129 
Gene Length1572 bp 
Protein Length523 aa 
Translation table11 
GC content74% 
IMG OID637880861 
Producthypothetical protein 
Protein accessionYP_482621 
Protein GI86742221 
COG category[J] Translation, ribosomal structure and biogenesis 
COG ID[COG0621] 2-methylthioadenine synthetase 
TIGRFAM ID[TIGR00089] RNA modification enzyme, MiaB family
[TIGR01125] MiaB-like tRNA modifying enzyme YliG, TIGR01125 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.309547 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.223285 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCACAC GCCTCCATCG CCGGGTAGCT CTGATCACGC TCGGTTGTTC CCGGAACGAG 
GTGGATTCGG AGGAGCTGGC CGCCCGGCTC GGCGCGGACG GGTGGGAGCT CGTCTCCGAC
GCCGCCGACG CCGACGCCGT GCTCGTCAAC ACCTGCGGAT TCGTCGACGC GGCGAAGAAG
GACTCCATCG ACGCTCTGCT TGCGGCGGAC GGCCTGCGGG CCGGTGGCGG TCCGTCCGGT
CCCGCGGACG GCGCCGGCCC GGGTCCGCGC GCGGTGGTGG CCGTGGGCTG CCTGGCCGAG
CGTTACGGAA CCGAGCTCGC CGAGAGCCTG CCCGAGGCGG ACGCGGTCCT CGGATTCGAT
GCCTATCCGA ACATCGCCAC GCACCTCGCG GCCGTGCTGG CCGGTACCCC GGTGCCCGCG
CACTCCCCGC GTGACCGGCG GACGATGCTG CCGATCACCC CGGTGGACCG GGCCGCCCCC
GCTCTCCCGC CGGCCGCGGT GTCCACCGGA GCCGTTCCGC TGCGCCGGCG GTTGACGGCC
GGGCCGGTCG CCGTCCTGAA GATCTCCAGC GGGTGTGACC GGCGCTGCGC CTTCTGCGCG
ATCCCGTCGT TCCGCGGCTC CCACGTCTCG CGGTCACCCG ACGACGTGCT CGCCGAGGCC
GAATGGCTGG CCGGTCAGGG TGCCCGCGAG CTGGTCCTGG TCAGCGAGAA CTCCACCTCC
TACGGCAAGG ACCTCGGGGA TCTGCGCGCG CTGGAGAAGC TGCTGCCGCA ACTCGCGGCG
GTGTCGGGCA TCGTCCGGGT GCGCACGGTC TACCTGCAGC CCGCCGAGAT GCGTCCGTCG
CTGCTCGAGG TGCTGCTGAC CACTCCGGGC CTCGCGCCCT ATCTCGATCT GTCCTTCCAG
CACGCGAGCC CGCCGGTGCT GCGCCGGATG CGCCGCTTCG GTGGCAGTGG GCACTTCCTC
GACCTGCTCG CCCGCGCCCG CGCGCTGGCG CCCGAGCTGG GGGCGCGATC CAACGTCATC
GTCGGGTTCC CGGGGGAGAC CCCCGAGGAT GTCGACATCC TCGCGGAGTT CCTCGAGGCC
GCCGAGCTCG ACGCCGTCGG CGTGTTCGGC TACTCCGACG AGGAGGGTAC CGAGGCCGCC
GGGCTGACAG ACAAGATCCC CGATGAGCTG ATCGAGCGCC GCCGGGTACG CGTCACCGAC
CTCGTCGAGC AGCTGACCGC CGCCCGGGCG GACGCGCGGA TCGGCTCCCG GGTACAGGTC
CTGGTGGAGG AGGTCGCCGG TGGTCTGGCC ACCGGCTGCG CCGCCCACCA GCAGGCGGAG
GTCGACGGCG GGTGCGTCGT GCGACTGTCC CCGGGCGGCG CGGCGGATGA CGGGCCGGAG
CGCCTCGGGG TGGGGGATCT CGTCGGGGTG GGGGATCTCG TCGAGGCCCG GGTGGTCGCC
ACCGAGGGTG TCGATCTGAT CGCGGAGTTC ATCGCGGTGC TCGATCGGGC CCGTCCCACG
GCTGCGGTCG CGCGGCCGAC GCCGGACCGG GCGGCGGCCC TCGTGGGCCG GGGCGTCGCG
GATGGGACGT GA
 
Protein sequence
MSTRLHRRVA LITLGCSRNE VDSEELAARL GADGWELVSD AADADAVLVN TCGFVDAAKK 
DSIDALLAAD GLRAGGGPSG PADGAGPGPR AVVAVGCLAE RYGTELAESL PEADAVLGFD
AYPNIATHLA AVLAGTPVPA HSPRDRRTML PITPVDRAAP ALPPAAVSTG AVPLRRRLTA
GPVAVLKISS GCDRRCAFCA IPSFRGSHVS RSPDDVLAEA EWLAGQGARE LVLVSENSTS
YGKDLGDLRA LEKLLPQLAA VSGIVRVRTV YLQPAEMRPS LLEVLLTTPG LAPYLDLSFQ
HASPPVLRRM RRFGGSGHFL DLLARARALA PELGARSNVI VGFPGETPED VDILAEFLEA
AELDAVGVFG YSDEEGTEAA GLTDKIPDEL IERRRVRVTD LVEQLTAARA DARIGSRVQV
LVEEVAGGLA TGCAAHQQAE VDGGCVVRLS PGGAADDGPE RLGVGDLVGV GDLVEARVVA
TEGVDLIAEF IAVLDRARPT AAVARPTPDR AAALVGRGVA DGT