Gene Franean1_1550 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1550 
Symbol 
ID5669953 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1852692 
End bp1853897 
Gene Length1206 bp 
Protein Length401 aa 
Translation table11 
GC content70% 
IMG OID641240469 
ProductDNA (cytosine-5-)-methyltransferase 
Protein accessionYP_001505895 
Protein GI158313387 
COG category[L] Replication, recombination and repair 
COG ID[COG0675] Transposase and inactivated derivatives 
TIGRFAM ID[TIGR01766] transposase, IS605 OrfB family, central region 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.734191 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.524569 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCGAGGT TTCGGTTGTA TCCCGATGCG GTGCAGGAAC AGGCCCTGTT GGTGCACTGT 
GGGCATGCTC GGTTCGTGTG GAATCTCGCG GTCGAGCAGC AGTCGTGGTA TCGGCCATGG
CGGGGGCGGG CGCCGGGCTA TGCGGAGCAG AACCGGCAGT TGACCGAGGC CCGGTCGGCC
AGTCCGTGGC TGGCGGCGGG CAGTGTCGTC GTGCAGCAGC AGGCTTTGCG TGACTTCGCG
ACGGCGATGG GGAACTTCTT CCGTGGTTCG CATCGCAGAC CCACTTTCCG GAGGCGTGGC
CATCACGAGG GGTTCCGGAT TGTGGCGGTG AAACCGGGCG ACGTGCGGCG GGTGAATCGC
CGGTGGGCGC GGGTGCGTGT CCCGAAGGTG GGCTGGGTGA GGTTCCGCTG GTCCCGTGCT
GTGCCGGGCG CGCGGTCGTA TCGGGTGACG CGGGATCGTG CGGGCCGCTG GCATGTGGCG
TTCGCCGTGA CCCCCGCTCC GATCCCCGCG CCGGGCACCC ATGCGGTCGT CGGGGTGGAC
CGTGGGGTGG TTGTGTCGGC GGCGCTGTCG ACCGGGGAAC TGCTGTCCTG TCCCGGTCTG
AGAGCTGGGG AGCGGGCGCG GCTGGTCCGG TTGCAACGCC GGTTGTCCAG GGCCAGGTGT
GGGTCCCAGC GGCGCCAGCG CCTCAAGGTG CGGATCGCAC GATCGCGGGC CCGGGAGGTT
GACCGGCGCA AGGACTGGGT CGAGAAGACC AGCACCGACC TCGCCCGCCG GTTCGACGTG
ATCCGCGTCG AGGACCTGAA GATCAGGCGG ATGACCCGCT CGGCTCGGGG CACCGTCGAG
GCGCCGGGAA GCAATGTCCG GCAGAAAGCC GGATTGAACC GGGGCATCCT CGCCCAGGGC
TGGGGTCTGC TCGTCCGCCG GTTGGAGGAG AAGGCCCCCG GCCGGGTCGA GAAGGTCCCC
GCCGCGTACA CGAGTCAGCG TTGTTCGGCC TGCGGGCAGG TGGCGTCCGG GAACCGTGAG
AGCCAAGCGG TCTTCTGGTG CGTGGTCTGC GGGCACACGG CCAACGCCGA CGTCAACGCG
GCGGTGAACA TCGCGGTTGG GTACATCGCG GCTGGACGGG CCGTGACCGC GCGGGGAGGC
GCGGCATTGG CCGGGCCCGT GAACCGCGAA CCTCAACACT GCGCACCTCT TCTGGTGGGT
GTGTAG
 
Protein sequence
MSRFRLYPDA VQEQALLVHC GHARFVWNLA VEQQSWYRPW RGRAPGYAEQ NRQLTEARSA 
SPWLAAGSVV VQQQALRDFA TAMGNFFRGS HRRPTFRRRG HHEGFRIVAV KPGDVRRVNR
RWARVRVPKV GWVRFRWSRA VPGARSYRVT RDRAGRWHVA FAVTPAPIPA PGTHAVVGVD
RGVVVSAALS TGELLSCPGL RAGERARLVR LQRRLSRARC GSQRRQRLKV RIARSRAREV
DRRKDWVEKT STDLARRFDV IRVEDLKIRR MTRSARGTVE APGSNVRQKA GLNRGILAQG
WGLLVRRLEE KAPGRVEKVP AAYTSQRCSA CGQVASGNRE SQAVFWCVVC GHTANADVNA
AVNIAVGYIA AGRAVTARGG AALAGPVNRE PQHCAPLLVG V