Gene Franean1_1000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1000 
Symbol 
ID5669414 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1180019 
End bp1181437 
Gene Length1419 bp 
Protein Length472 aa 
Translation table11 
GC content70% 
IMG OID641239928 
ProductDNA methyltransferase 
Protein accessionYP_001505362 
Protein GI158312854 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCATTGC GGCATGGTGA TCGTTTGGGT GTCGCGCACA GGACAGCGAC CAGGCCAGAT 
CGGCCCCAGA CCGTTCGTGC CCCCGCGCAA GCGTCCCGGG CTCCCTCGCG CATACTGCGG
AGCCCCGAGC CGGGGCTCTC CCCACGCCGC CGGCCCTCTC CTGGAGGCCA CGTGAGTACG
ACAGCCGACC GGACGTCCCA CGCGCGCAAC GAGCACATGA CATTCCGGGC GAACCGGGGT
GTAGGGCGGC ATGGCTGGCT GCGGCTCACG CCCGCCTACG GAGTCCGGCT CGTACGGGGG
CGTATCTCCC ACCTGCCCGC CGGCTCGGTC ATCACCGACC CGTTCTCCGG AACGGGGACG
ACCCCGCTCG CCGCGGCCGA GCTCGGCCAT CACGGCCAGA GCGCCGATCT CAATCCCTTC
CTCGTCTGGT TGGGGCGGGC CAAGGTCCGC CAATACCCGC GGCAGACCCT TGCGGACGCG
GCCACCGCGG CCGCCGATGC CGGGGACGCC GCCGCCCGCA TGGGCCGTGA CACCGAGTTG
TGGCAGCCGA ACATCTTCCG GATCGAGAAA TGGTGGAGCC CCGGGGCGCT GCACGCGCTG
CGCGCACTCC GGGCGGCGCT CGACGCATAT TCAGGACCGG CGGGTGATCT CCTGCAGATC
GCACTGTGTC GGGTACTGAT CTCCGTCAGC AACGCGGCCT TCAATCATCA GTCCATGTCG
TTCAAGGCCG CCGCCGGCGA GACCCGGCCC GGCTCCTTCG ACCCGGACGC GGCCGCGGCC
ACCATCGCGC TGTTCGGCAC CGAGGCGGCA GCCCTGATCG AATCCGCCCG GGTCGACCTG
CCGGGCTCCG CCGCCGTCCA CGAGGGCGAC TCGCGCTCCG TCGTGCCGGA CCTGCGCGAA
ACCGATCTCG TGCTGACCAG CCCGCCCTAC GTGAACCGGA TGTCCTACAT CCGCGAGCTC
CGGCCGTACA TGTACTGGCT GCGCTACCTC GACCGCGCCG GCGACGCCGG TGAGCTGGAC
TGGCGCGCGA TCGGTGGCAC CTGGGGGAGC GCCACGTCCA ACCTGCGCTC CTGGACGCCG
GCCACGCCCA CCCCGGTCGA CGAGGCGCTG GAGGCCGTCT GCGCGCGGAT CGCGGCCGAC
GGCGACCGGA ACGGGCCGCT GCTGGCGACC TATGTGCGGA AATACCACCA CGACATGTGG
CTGCACTTCC AGACCGTCAC ACCACTGGTG AAACGCGGCG GGCAGGTTTC CTACATCGTG
GGGAACTCGA CGTTCTACGG CCACGGTGTC CCTGCTCAGG ACTGGTATGC GTTGATGCTG
CGCGAGCTCG GCTACGCCGA CGTCGAGGTG GAGGTTATTC GCAAGCGGAA CTCAAACAAG
GCCCTGTTCG AATTCGACGT CCGGGCCCGC CGGCCTTGA
 
Protein sequence
MPLRHGDRLG VAHRTATRPD RPQTVRAPAQ ASRAPSRILR SPEPGLSPRR RPSPGGHVST 
TADRTSHARN EHMTFRANRG VGRHGWLRLT PAYGVRLVRG RISHLPAGSV ITDPFSGTGT
TPLAAAELGH HGQSADLNPF LVWLGRAKVR QYPRQTLADA ATAAADAGDA AARMGRDTEL
WQPNIFRIEK WWSPGALHAL RALRAALDAY SGPAGDLLQI ALCRVLISVS NAAFNHQSMS
FKAAAGETRP GSFDPDAAAA TIALFGTEAA ALIESARVDL PGSAAVHEGD SRSVVPDLRE
TDLVLTSPPY VNRMSYIREL RPYMYWLRYL DRAGDAGELD WRAIGGTWGS ATSNLRSWTP
ATPTPVDEAL EAVCARIAAD GDRNGPLLAT YVRKYHHDMW LHFQTVTPLV KRGGQVSYIV
GNSTFYGHGV PAQDWYALML RELGYADVEV EVIRKRNSNK ALFEFDVRAR RP