Gene Franean1_3354 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3354 
Symbol 
ID5671725 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp3970817 
End bp3972028 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID641242242 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001507662 
Protein GI158315154 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTCGTGT CCAGGTTCCG GTTGTATCCC GACGCGGCGC AGGAAGAGGC TCTGCTGGTG 
CACTGTGGGC ACGCCCGGTT CGTGTGGAAC CTCGCGGTCG AGCAACAGTC GTGGTACCGG
CTGTGGTGTG GTCGGGCGCC AGGTTATGTG GAGCAGAACC GGCAGTTGAC CGAAGCCCGG
TCGGATAATC CATGGCTGGC GGCGGGCAGT GTCATCGTGC AGCAGCAGGC TTTGCGTGAC
TTCGCGACGG CGATGGCGAA CTTCTTCCGC GGTTCGCATC GCAGGCCCAC CTTCCGTAGG
CGTGGGCGTG GTGAGGGGTT CCGGATTGTG GCGGTGAGAC CGGGCGACGT CCGGCGGGTG
AATCGTCGGT GGGCGTGGGT GCGTGTCCCG AAGGTGGGCT GGGTGCGGTT TCGCTGGACT
CGTGAGGTGT CGGGTGCGCG GTCGTATCGG GTGACGCGGG ATCGTGCGGG CCGTTGGCAT
GTCGCGTTCG CCGTGGCCCC GAATCCGATT CCCGCGCCGG GAACCGAGAA GGTTGTCGGT
GTGGACCGTG GGGTGGTGGT GTCGGCGGCG CTGTCGACCG GGGAGCTGCT GTCCTGTCCC
GGTCTGCGAG CCGGGGAGCA GGGGCGGCTG GTCCGGTTGC AGCGCCGATT GTCGAGGGCC
AGGCGTGGGT CGCGGCGGCG CGGGCGCGTC AAGGCCCGGA TCGCACGGCT GCGTGCCCGG
GAGGTTGACC GGCGCAAGGA CTGGGTCGAG AAGACCAGCA CGGATCTCGC TCGTCGGTTC
GACGTGATCC GGGGCGAGGA CCTGAAGATC AGGGGGATGA CCCGCTCTGC CCGGGGCACC
GTCGAGGCGC CGGGAAGCAA CGTCCGGCAG AAGGCCGGGT TGAACCGGGG CATCCTCGCC
CACGGTTGGG GTCTGCTCGT CGCACGGTTG GAGCAGAAGG CCCCCGGCCG GGTGGAGAAA
GTCCCCGCCG CGTACACGAG CCAGCGTTGC TCGGCCTGCG GGCATAGGGC GCCCGGGAAC
CGCGAGAGCC AAGCGGTCTT CCGGTGCCTG GCCTGCGGGC ACACGGCCAA CGCCGACGTC
AACGCGGCTA TGAACATCGC GGTTGGGAAC ATCGCGGCCG GACGGGCCGT GACCGCGCGG
GGAGGCACGG CGCTGGCCGT GCCCGCGAAC CGCGAACCTC AACACCGCGT ACCCCTTCCG
GTGGGTGTGT AG
 
Protein sequence
MVVSRFRLYP DAAQEEALLV HCGHARFVWN LAVEQQSWYR LWCGRAPGYV EQNRQLTEAR 
SDNPWLAAGS VIVQQQALRD FATAMANFFR GSHRRPTFRR RGRGEGFRIV AVRPGDVRRV
NRRWAWVRVP KVGWVRFRWT REVSGARSYR VTRDRAGRWH VAFAVAPNPI PAPGTEKVVG
VDRGVVVSAA LSTGELLSCP GLRAGEQGRL VRLQRRLSRA RRGSRRRGRV KARIARLRAR
EVDRRKDWVE KTSTDLARRF DVIRGEDLKI RGMTRSARGT VEAPGSNVRQ KAGLNRGILA
HGWGLLVARL EQKAPGRVEK VPAAYTSQRC SACGHRAPGN RESQAVFRCL ACGHTANADV
NAAMNIAVGN IAAGRAVTAR GGTALAVPAN REPQHRVPLP VGV