Gene Franean1_6840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6840 
Symbol 
ID5675153 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8340332 
End bp8341522 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content68% 
IMG OID641245689 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001511080 
Protein GI158318572 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.643066 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTCCCGCT ACCGGCTCGA CCCGGCACCC GAACAGGTCG CTGGAATCGA GGAGCACTGC 
GGGCACGCGC GTTTTGTTTG GAATCTTGCG GTCGAACAGC AGTCGTGGTG GAAGCCCGGC
CAGGGAAGGG CGCCGAACCA TGCGGAGCGT TGCCGGCAGT TGACCGAAGC GCGGGCAGAG
TTCGAGTGGC TGCGAGCCGG TTCGCAGACC GTTCAGCAGC AGGCGCTTCG AGACTTCGAC
CAGGCTATAC GGAACTTCTT CAACGGCTCG CACCGCCACC CGACCTTTCG GAAACGTGGC
CGGTCCGAGG GTTTCCAGAT CGTAGGGAAG AACGCACGGG TCGAGAAGCT GAACCGGAAG
TGGTCCCAAT GCTGGATCCC GAAGGTCGGC TGGGTGAAGT TCCGCGTGTC GCGGATAATC
CCGGACTTCA GGTCGTACCG GGTGACCCGG GACCGGGCGG GCCGCTGGCA TGTGGCGTTC
GCCGTAGCAC CCGACCCGGT CCCCGCGCCA GGAACCGGTG AGGTCGGCGA GGTCGTCGGT
GTGGACCGGG GTGTCGCCGT GTCCGCGGCG CTGTCCACCG GGGAGCGGCT GTCCTGCCCG
ACGCTGAGGC CGAAGGAAGC CGAACGACTC CGCCGGCTCC AGCGCCGGCT GGCGAAAGCC
ACACGCGGGT CGAACCGGCG CGGCCGGCTG AGGACCCAGA TCGCCCGGGT GAAGGCCCGG
GAGGCCGACC GGCGGAAAGA CTGGGTGGAG AAAACCTCCA CCGATCTTGC CCGCCGGTTC
GACGTCATCC GGGTGGAAGA TCTCCGCATC ACGAACATGA CCCGGTCGGC CCGAGGCACT
GTCGAACAGC CGGGCCGGAA TGTTCGGCAG AAAGCCGGAC TGAACCGGGG CATCCTCGCC
AACGGCTGGG GTCTGCTTGC CCGGCGGTTG GAACAGAAAG CACCCGGCCG GGTGGAGAAG
ATCCCGGCCG CCTACACCAG TCAGTGTTGC TCGTCCTGCG GGCATGTGGC GCCCGGGAAC
CGCGAGAGCC AAGCGGTGTT CCGGTGCGTC GCCTGCGGAC ACACGGCCAA CGCGGACGTG
AACGCTGCAT GCAACATCGC GGCTGGACGG GCCGTGACCG CGCGGGGAGG CGCAGCATTG
GCCGCGAACC GCGAACCTCA ACACTCCACG CCTCCTCTGG TGGATGGGTA G
 
Protein sequence
MSRYRLDPAP EQVAGIEEHC GHARFVWNLA VEQQSWWKPG QGRAPNHAER CRQLTEARAE 
FEWLRAGSQT VQQQALRDFD QAIRNFFNGS HRHPTFRKRG RSEGFQIVGK NARVEKLNRK
WSQCWIPKVG WVKFRVSRII PDFRSYRVTR DRAGRWHVAF AVAPDPVPAP GTGEVGEVVG
VDRGVAVSAA LSTGERLSCP TLRPKEAERL RRLQRRLAKA TRGSNRRGRL RTQIARVKAR
EADRRKDWVE KTSTDLARRF DVIRVEDLRI TNMTRSARGT VEQPGRNVRQ KAGLNRGILA
NGWGLLARRL EQKAPGRVEK IPAAYTSQCC SSCGHVAPGN RESQAVFRCV ACGHTANADV
NAACNIAAGR AVTARGGAAL AANREPQHST PPLVDG