Gene Franean1_4885 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4885 
Symbol 
ID5673225 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5860221 
End bp5861414 
Gene Length1194 bp 
Protein Length397 aa 
Translation table11 
GC content74% 
IMG OID641243740 
Productpeptidase M50 
Protein accessionYP_001509156 
Protein GI158316648 
COG category[R] General function prediction only 
COG ID[COG1994] Zn-dependent proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.333233 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGGACC AGCCCGGCGC GGCGCGGCAC GGCGTCCCCG GTGGAGCGGA GCCGCCGGAG 
CGGCGGCCGG CGCCGCGTCC GCCGGGTATG CGGGTCGGGC ACGTCCGCGG TGCCCCGATC
TTCGTCTCAC CGCTGGCCGT GGTCTTCGCG GCGCTGGTCG CGTCCCTGCT CATCGACCCG
ATCCGGGACC ATCTGACGAT CGCCGCGACC GACGGGCATG TGCTCGTACT GTCCCTGCTC
GTCTCGGCGG GGTTCCTCCT GTCGCTGCTC GCGCACGAGA TCGGGCACGC GCTGACCGCG
CAGCGGTTCG GGCTGCACGT CCGGTCGGTG ACCCTGCACG GCTTCGCCGG TTTCACCGAG
TTCGAACCCG AGCCGGCCAC CGCCGCCCGC GAGTTCCTGA TCGCCTTCGT CGGGCCCGCC
GTCAACGGCG TGATCGGGGG GCTGTGCTTC CTGGGCCTGC TGGCCGTGCC GGATCACGGC
AGCGTCGGGA CAGTGCTCTT CTTCATCGGC TTCACCAACG CCGCGCTGTT CGTGTTCAAC
CTGGCGCCGG GCCTGCCTCT CGACGGCGGG CGGGTCGTCG TCGCGGCGGT GTGGGCCATC
GGGCACGACA AGCTGCGTGG GCTGCGGGCC GGCGCCTACG GCGGCTTCAT CGTGGCTGGT
GCGCTGGTGG TCTGGGGCGC GACGAAGGAC TCCGACGGCT TCGGTGCGCT CTACGCCTAC
GTCCTGGCCG GGTTCGTCGC CTTCGGCGCC TACCAGTCGC TGCGGTCGGC GAAGGTCCGC
GAGCGGCTGC CCGGGCTGTC GGCGGGCCGG CTCGCCCGCC GCACGCTGCC GGTGGAGGCC
GCGGTCCCGC TCGGCGAGGC GCTGCGCCGG GCCCAGGAGG TGGGCGCCAC GGCCGTCGCG
GTGATCGACC GGGACGGTAC CCCCATCAAG ATCATGAATG GCGCCTCCGT CGACGCCCTG
CCGGAGCACC GCCGGCCGTG GATGGCGGTG GACGAGGTCA GCCGGAGCAT CGGCCCGGGG
ATGACCCTGC AGGCCGAGCT CGAGGGCGAG GACCTGCTGG CGGCCGTCCA GCGGGCCCCG
GCCGCGGAGT ACCTCGTGGT CCAGCGCGGC CGGCCGGTCG GCGTGCTGGC CATGGTCGAT
CTCGTGGCCA GGATCGACCC GGCGGCCGCG ACCCGTATGG TCTCCCATCG GTGA
 
Protein sequence
MEDQPGAARH GVPGGAEPPE RRPAPRPPGM RVGHVRGAPI FVSPLAVVFA ALVASLLIDP 
IRDHLTIAAT DGHVLVLSLL VSAGFLLSLL AHEIGHALTA QRFGLHVRSV TLHGFAGFTE
FEPEPATAAR EFLIAFVGPA VNGVIGGLCF LGLLAVPDHG SVGTVLFFIG FTNAALFVFN
LAPGLPLDGG RVVVAAVWAI GHDKLRGLRA GAYGGFIVAG ALVVWGATKD SDGFGALYAY
VLAGFVAFGA YQSLRSAKVR ERLPGLSAGR LARRTLPVEA AVPLGEALRR AQEVGATAVA
VIDRDGTPIK IMNGASVDAL PEHRRPWMAV DEVSRSIGPG MTLQAELEGE DLLAAVQRAP
AAEYLVVQRG RPVGVLAMVD LVARIDPAAA TRMVSHR