Gene Franean1_1280 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1280 
Symbol 
ID5669693 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1542776 
End bp1543987 
Gene Length1212 bp 
Protein Length403 aa 
Translation table11 
GC content70% 
IMG OID641240212 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001505640 
Protein GI158313132 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.138792 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCTGTGT CGAGGTTTCG GTTGTATCCC GATGCGGTGC AGGAACAGGC CCTGTTGGTG 
CACTGTGGGC ATGCTCGGTT CGTGTGGAAC CTCGCGTGTG AGCAGCAGTC GTGGTACCGG
CCGTGGCGTG GCCGGGCGCC GGGTTATGCG GAGCAGAACC GGCAGTTGAC CGAGGCCCGG
TCGGGCAATC CGTGGCTGGC GGCGGGCAGT GTCATCGTGC AGCAGCAGGC TTTGCGTGAC
TTCGCGACGG CGATGGCGAA CTTCTTCCGC GGTTCGCATC GCAGGCCCAC TTTCCGTAGG
CGTGGGCGTG GTGAGGGGTT CCGGATCGTG GCGGTGAAAC CGGGCGACGT CCGGCGGGTG
AATCGCCGGT GGGCGCGGGT GCGTGTCCCG AAGGCGGGCT GGGTGAGGTT CCGCTGGTCC
CGTGCTGTGC CGGGCGCGAG GTCGTATCGG GTGACGCGGG ATCGTGCGGG CCGCTGGCAT
GTGGCGTTCG CCGTGGCCCC GCATCCGATC CCCGCGCCGG GTACGGGCGG GGTTGTCGGG
GTGGACCGTG GGGTGGTCGT GTCGGCGGCA CTGTCGACGG GGGAGCTGCT GTCCTGCCCC
GGTCTGCGAG CCGGGGAGCG GGCGCGGCTG GTCCGGTTGC AGCGCCGGCT GTCGAGGGCC
AGGCGTGGGT CGCAGCGGCG CGGGCGCCTC AAGGCCCGGA TCGCACGGCT GCGTGCCCGG
GAGGTTGACC GGCGCAAGGA CTGGGTCGAG AAGACCAGTA CCGATCTTGC CCGCCGGTTC
GACGTGATCC GCGTCGAGGA TCTGAAGATC GGGCAGATGA CCCGCTCTGC GCGGGGCACC
GTCGAGGCGC CGGGAAGCAA CGTGCGGCAG AAAGCCGGGT TGAACCGGGG CATTCTGGCT
AACGGTTGGG GTCTGCTCGT CGCGCGGTTG GAGGAGAAGG CCCCCGGCCG GGTGGAGAAG
GTCCCTGCCG CGTACACGAG TCAGCGTTGC AGTGCCTGCG GGCATGTGGC GTCCGGGAAC
CGTGAGAGCC AAGCGGTCTT CTGGTGCGTC GCCTGCGGGT ACACGGCCAA CGCCGACGTG
AACGCGGCGG TGAACATCGC GGTTGGGAAC ATCGCGGCCG GACGGGCCGT GACTGCGCGG
GGAGGCGCGG CGTTGGCCGT GCCCGTGAAC CGCGAACCTC AACACTGCGC ACCCCTTCTG
GTGGGTGTGT AG
 
Protein sequence
MAVSRFRLYP DAVQEQALLV HCGHARFVWN LACEQQSWYR PWRGRAPGYA EQNRQLTEAR 
SGNPWLAAGS VIVQQQALRD FATAMANFFR GSHRRPTFRR RGRGEGFRIV AVKPGDVRRV
NRRWARVRVP KAGWVRFRWS RAVPGARSYR VTRDRAGRWH VAFAVAPHPI PAPGTGGVVG
VDRGVVVSAA LSTGELLSCP GLRAGERARL VRLQRRLSRA RRGSQRRGRL KARIARLRAR
EVDRRKDWVE KTSTDLARRF DVIRVEDLKI GQMTRSARGT VEAPGSNVRQ KAGLNRGILA
NGWGLLVARL EEKAPGRVEK VPAAYTSQRC SACGHVASGN RESQAVFWCV ACGYTANADV
NAAVNIAVGN IAAGRAVTAR GGAALAVPVN REPQHCAPLL VGV