Gene Franean1_0931 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0931 
Symbol 
ID5669345 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1087176 
End bp1088462 
Gene Length1287 bp 
Protein Length428 aa 
Translation table11 
GC content73% 
IMG OID641239858 
Productpeptidase M16 domain-containing protein 
Protein accessionYP_001505293 
Protein GI158312785 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.557944 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGCCC CCTCATACCC GGTTCAGCGA GTGACGCTCG ACAACGGCCT GCGGGTGCTG 
CTCGCCCCGG ACCGGACGGC CCCGGTGGTG GCTGTCGCCG TCCACTACGA CGTGGGCTTC
CGTTCGGAGC CGGAGGGGCG GACGGGTTTC GCCCACCTGT TCGAGCACCT GATGTTCCAG
GGCAGCGAGA ACGTGGGCAA GGCGGAGCAC CCGAAGTACG TGCAGGCCGC CGGCGGCATC
TTCAACGGGT CGACACACCC CGACCACACG GACTACTTCG AGCTGCTGCC CTCCGGCGCG
CTGGAGCTGG CACTGTTCCT CGAGGCCGAC CGGATGCGCG CGCCGAGGAT CACCCGGGAG
AACCTGGACA ACCAGATCGC GGTGGTGCAG GAGGAGATCC GGGTCAACGT GCTGAACCGG
CCGTACGGCG GGTTCCCGTG GATCACCCTC CCGCCGGTCG CCTTCGACAC CTTCCCGAAC
GCCCACAACG GCTACGGCGA CTTCTCCGAG CTCGAGGCGG CCAGCATCGA CGACGCCGCT
GACTTCTTCG ACAAGTTCTA CGCGCCGGGC AACGCGGTGC TGACCGTCGT CGGTGAGTTC
GACCCCGACG CGACGCTCGA GCTCGTCCAG CGGTACTTCG GCGCGATCCC GGCCCGCGCG
GTGCCGGCGC GGCGCAGCTT CGCCGAGCCG GTGCGGGCCG AGGCGCGCCG GGAGGCACTC
ACCGACAAGC TGGCCCCGCG CCCCGCGCTC GCCGTCGGCT ACCGGGTGCC CGACCCCGAC
ACCGACCTGC CGGCGTTCCT CGCCACCTAC CTGCTGACGG ACGTCCTCAC CACCGGCGAC
GCGAGCCGGC TGGAGCGGCG CCTCGTCCAG AAGGACCGCT CGGTCACCGC GGTCAGCAGC
TACGTCGGCA CCTTCGGGGA CCCGTTCGAC CAGCGTGACC CGCTGCTGCT GACGCTGGAG
GCCCGGCACG CCGGGGACTC GACCGCCGAC ACGGTGCTCG CGGCCGTCGA CGAGGAGCTC
GACCGGCTGG CCGGTGACGG CCTGGAGCCC GGCGAGCTCG AGCGGGTGCA GGCCCAGGTG
GCCTCGGCGA TCCTGCGCGA GTCCGACGAC GCGCTCGGCC GGGCGCTGGC GATGGCGACG
TTCGAGCTGC ACCGCGGCCG TCCCGAGCTG CTCAACGAGC TGCCCGGGCT GCTGTCCGAG
GTGACCGCCG ACGCGGTCGC CGCGGCCGCG GGCGCGCTGC GTACCCAGGG CCGGTCCGTG
CTGGAGCTGC GGGCGGGAGC CGCCTGA
 
Protein sequence
MAAPSYPVQR VTLDNGLRVL LAPDRTAPVV AVAVHYDVGF RSEPEGRTGF AHLFEHLMFQ 
GSENVGKAEH PKYVQAAGGI FNGSTHPDHT DYFELLPSGA LELALFLEAD RMRAPRITRE
NLDNQIAVVQ EEIRVNVLNR PYGGFPWITL PPVAFDTFPN AHNGYGDFSE LEAASIDDAA
DFFDKFYAPG NAVLTVVGEF DPDATLELVQ RYFGAIPARA VPARRSFAEP VRAEARREAL
TDKLAPRPAL AVGYRVPDPD TDLPAFLATY LLTDVLTTGD ASRLERRLVQ KDRSVTAVSS
YVGTFGDPFD QRDPLLLTLE ARHAGDSTAD TVLAAVDEEL DRLAGDGLEP GELERVQAQV
ASAILRESDD ALGRALAMAT FELHRGRPEL LNELPGLLSE VTADAVAAAA GALRTQGRSV
LELRAGAA