Gene Franean1_1215 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1215 
Symbol 
ID5669628 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1449976 
End bp1451175 
Gene Length1200 bp 
Protein Length399 aa 
Translation table11 
GC content71% 
IMG OID641240147 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001505575 
Protein GI158313067 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0567342 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0860408 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAGGCGGG CGTTCAAGTT CCTGCTCCGC CCGACGGCGC GGCAGGCCAC CGCGCTGACG 
GCGATGATCG ATGATCATCG GGCGCTCTAC AACGCCGCGT TGCAGGAACG ACGCGACGCC
TACCGGCATC CGTCGAAGGC GACGGTTCGC TACGGCGACC AGTCCGCCCA GCTCAAGGAG
ATCCGCGCCT GCGACCCGGA TCAGGGCCGC TGGTCGTTCT CCTCCCAGCA GGCCACCCTG
CGTCGCCTCG ACAAGGCGTT CGCCGGCTTC TTCCGCCGCG TCAAAGCAGG CGAGACCCCT
GGCTACCCGC GGTTCAGAGG CGCGGGCCGG TTCGACACGG TCGAGTGGCC GAGGGACGGG
GACGGCTGCC GCTGGAACTC CCAGCCTGAG CATCCCACCC GGACCCGGGT CCGGCTTCAA
GGTGTCGGTC ACGTCAAGGT TCACCAGCAC CGGCCGGTGG CGGGCACGGT CAAGACGGTC
TCGGTGAGGC GGGAAGGCCG CCGCTGGTAT GTGGTCCTCT CCTGCGACGA CGTGCCCGCG
CGGCCGCTGC CGGCCACCGG GGTGGTGGTG GGGGTGGATA TGGGTGTGGC GTCGCTGGTG
ACCCTCTCGG ATGGCCGTCA GGTCGGTAAC CCGCGTTTTC TTGCCGCGGC GGCCGGTCGG
CTCGCGCGTG CGCAACGGGA ACTGGCCCGT AAGAAGCGGG GGTCGACCCG GCGCCGGAAG
ACCGTCGCGA AGGTCGCCGC CCTGCACCGC AGGGTTCGCC GGCAGCGGCT CGACCTCGCC
CACACGGTCG CACGCGACCT GGTCCGCGAC CACGATCTGA TCGCCGTGGA GGCACTGCGG
ATCGTGAACA TGACCCGCCG GGCCGTGCCG AGACCCGACC CCGACCGGCC CGGAGCTTTC
CTGGCGAACG GGCAGGCGGC GAAGTCCGGA TTGAACAGGA GCGTTCTCGA CGCGGGGTGG
GGGGTGTTCC TCGCCGTGCT GCGTGCCAAG GCTGAAAGTG CCGGACGGGT GGTCGTCGAG
GTGAACCCCG CCAACACCTC CCGCACGTGT GCGGTCTGCG GGCACTGCCA CGCCGACAAC
CGCAGGACAC AGGCCGCGTT CGTCTGTGTC GCGTGCGGGC ATGCCGCGCA CGCCGACGTG
AACGCGGCGA TCAACATTCT TCGGGTCGGG CTGGCCCGTC AGGGCGCGGA AGCGGCCTGA
 
Protein sequence
MRRAFKFLLR PTARQATALT AMIDDHRALY NAALQERRDA YRHPSKATVR YGDQSAQLKE 
IRACDPDQGR WSFSSQQATL RRLDKAFAGF FRRVKAGETP GYPRFRGAGR FDTVEWPRDG
DGCRWNSQPE HPTRTRVRLQ GVGHVKVHQH RPVAGTVKTV SVRREGRRWY VVLSCDDVPA
RPLPATGVVV GVDMGVASLV TLSDGRQVGN PRFLAAAAGR LARAQRELAR KKRGSTRRRK
TVAKVAALHR RVRRQRLDLA HTVARDLVRD HDLIAVEALR IVNMTRRAVP RPDPDRPGAF
LANGQAAKSG LNRSVLDAGW GVFLAVLRAK AESAGRVVVE VNPANTSRTC AVCGHCHADN
RRTQAAFVCV ACGHAAHADV NAAINILRVG LARQGAEAA