Gene Francci3_2197 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2197 
Symbol 
ID3906336 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2569242 
End bp2570618 
Gene Length1377 bp 
Protein Length458 aa 
Translation table11 
GC content66% 
IMG OID637879529 
ProductRNA-directed DNA polymerase 
Protein accessionYP_481295 
Protein GI86740895 
COG category[L] Replication, recombination and repair 
COG ID[COG3344] Retron-type reverse transcriptase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.187035 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGTGCCA TTGCTGCTAT TCCCGCCGCT TCGGGATCCC GAGTGGATCC GGTCCGTGTC 
TTGCAGCACA CGCTGTACCG GGCGGCCAAG GCCGATCCCG GGCGGAGGTT CCATGCGCTC
TCGGACAAGG TCCACCGTGG GGACGTCCTG GAGCGTGCGT GGGCACTGGT GCGCGCCAAC
AGGGGCGCGC CGGGCATTGA CCGGACCACC ATTGCCCAGG TCGAGGAGTA CGGGGTGTCC
CGTCTTCTCG ACGAGCTGGG CGGCGAATTG CGGGAAGGGG GGTACCGTCC TTTGCCTGTG
CGCCGGGTAG AAATTCCCAA GCCGGGGGAC AAGGGTGCGT CCCGGCCGCT GTCGATCCCT
GCCGTCCGCG ACCGTGTGGT GCAGACCGCC GTGAAGATCG TGCTCGAGCC GGTCTTCGAG
GCGGACATGG CCGGGTGCTC GTTCGGGTTC CGCCCGCGGC GCTCGGCTCA TGACGCACTG
CAAGTGCTGG TGGATGCGTG CGGGAAGGGC CGTCGGTGGG TGGTGGAGAC GGATATCGCT
GATTGCTTTA CGGCGATCCC GTTCGAAGGG TTGATGCAGA CGATCAAGGA ACGGGTCTGC
GACCAGGCGA TGCTCAGGTT GGTGCGTGCT GTCCTGCGTG CCGGCGTGAT GCGCGACGGC
GAGGTCCGGA GGCCGGTCAG CGGAATGTCC CAGGGCGGGC CGCTGTCCCC ATTACTCTGT
AACGTCTACC TGAACCGGCT TGACCGGGAA TGGGACGACG GGGACGGGAT GATGGTCCGG
TTCGCGGACG ATATCGTCGT GATGTGCTGG TCGGAGGATC AGGCGGTGCG GGCGATGGAT
TGTCTGGTCC GGTTGCTCGC TGATCTGGGG CTGGAGACCA AGGCAGCCAA GACCCGTATC
GTTCACCTAC AGGTGGGAGG GGATGGTCTT GAGTTCCTGG GCTTCCATCA CCGCCTGGTG
CTGTCGCGGG GTGACCGGGG CAGACGGAGG GTGGCGTTTC TTGCGCGCTG GCCCTCGGAC
AGGGCCATGC AGCATGCTCG GGATCGGATT CGGGAGATTA CGGGACGGTC CAGGCTGCTG
CTTAATCCAG AGGCGGTCGT GCGGGAACTG AACCTGTTCC TGCGTGGCTG GGCCGGATAT
TTCCGCTACG GATATTCGGC ACGGCGCCTT GCTATGATCA GGCGGTTCGC GCAGGTACGG
CTGGCTCGGT TTGTTCGGGC CAGACACCGG CGCAGCATGG GCTTCGGCTG GCGGGTGCTG
ATACGCTCCC GGCCGGTGGA CCTCGGCCTG GTCAGCCTGT CTGGGATTGT CGTCACTCCC
AGGGCCGGAA AGCTCTGGCG GGAGAGACCG AATGCCGGCG GTGAACGGCG TCGGTAA
 
Protein sequence
MSAIAAIPAA SGSRVDPVRV LQHTLYRAAK ADPGRRFHAL SDKVHRGDVL ERAWALVRAN 
RGAPGIDRTT IAQVEEYGVS RLLDELGGEL REGGYRPLPV RRVEIPKPGD KGASRPLSIP
AVRDRVVQTA VKIVLEPVFE ADMAGCSFGF RPRRSAHDAL QVLVDACGKG RRWVVETDIA
DCFTAIPFEG LMQTIKERVC DQAMLRLVRA VLRAGVMRDG EVRRPVSGMS QGGPLSPLLC
NVYLNRLDRE WDDGDGMMVR FADDIVVMCW SEDQAVRAMD CLVRLLADLG LETKAAKTRI
VHLQVGGDGL EFLGFHHRLV LSRGDRGRRR VAFLARWPSD RAMQHARDRI REITGRSRLL
LNPEAVVREL NLFLRGWAGY FRYGYSARRL AMIRRFAQVR LARFVRARHR RSMGFGWRVL
IRSRPVDLGL VSLSGIVVTP RAGKLWRERP NAGGERRR