Gene Franean1_0866 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0866 
Symbol 
ID5669280 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1012199 
End bp1013230 
Gene Length1032 bp 
Protein Length343 aa 
Translation table11 
GC content74% 
IMG OID641239793 
Productpeptidase S1 and S6 chymotrypsin/Hap 
Protein accessionYP_001505228 
Protein GI158312720 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0862974 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.586989 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGCAG GCGCCACCGC CAACCCGGGC CCCGAGGACC AGATCGCACT CCCCGACGGG 
ACCGCGCCGT CCGATGATCG TGGTGAGGCG GCGCTGGACG CATACTCCCG TGTGGTGACC
CGGGTGGCCG AGCGGGTCGG GCCGAGCGTG GGAAGCCTGC GGGTGCGCAC CTCGCGCGGC
GCCGGAGCCG GTTCCGCGGT GCTCTTCACC GAGGACGGGT TCCTGCTCAC CTCCGCGCAT
GTCGTCGAGG GACGTCCAGG CACCGGCCGG GCCGGCGAGC CGGCTGGCAC GGTGGAGTTC
GTCGACGGGA CCGAGCGTGC CGTGGACCTG GTGGGAGCCG ACCCGCTCTC GGACCTCGCC
GTGCTGCGGG CCCGCGGCTC CACACCGCGC CCGGCCGAGC TCGGCGACGC GGCGATGCTG
CGGGTCGGCC AACTCGTCGT CGCCGTCGGC AATCCGCTGG GACTGACCGG GAGCGTCACC
GCCGGCGTGG TCAGCGCCCT GAACCGCTCG CTGCCGACCC GCTCCGGTTC GGCCGTGCGC
GTCGTGGACG AGGTCATCCA GACGGACGCC GCGCTGAACC CGGGAAACTC CGGCGGGGCA
CTGGTCACCG CCGACGGCCG CGTGGTCGGG GTGAACACCG CCGTCGCCGG CGTCGGACTC
GGCCTGGCCG TCCCCGTGAA CGCCACCACC AGGCGCATCC TCGCGGCACT GATCCGGGAT
GGCCGAGTCC GCCGCGCCTA CCTCGGTGTC GCCGGTGCCC GGGTGCCGCT CCCGCCGGCC
CTGGCCGAGC GGACGGGGCA ACGCCACGGC GTGCGCCTCG CGGAGGTGGT ACAGGGTAGC
CCAGCCGGGC AGGCCGGACT GTTCACCGAC GATCTCGTCC TGTCGATCGC CGGGACGCCT
GTCGCCGGCC CGGGCGATCT CCAGCGACTG CTGACCGAGG ACACCATCGG ACAACCCGTC
GAAATGACAG TCTGGCGTCG CGGTGCCCTC GTGGACGTCA TAGCCGTGCC CCGGGAACTC
GTAACCCCGT AG
 
Protein sequence
MDAGATANPG PEDQIALPDG TAPSDDRGEA ALDAYSRVVT RVAERVGPSV GSLRVRTSRG 
AGAGSAVLFT EDGFLLTSAH VVEGRPGTGR AGEPAGTVEF VDGTERAVDL VGADPLSDLA
VLRARGSTPR PAELGDAAML RVGQLVVAVG NPLGLTGSVT AGVVSALNRS LPTRSGSAVR
VVDEVIQTDA ALNPGNSGGA LVTADGRVVG VNTAVAGVGL GLAVPVNATT RRILAALIRD
GRVRRAYLGV AGARVPLPPA LAERTGQRHG VRLAEVVQGS PAGQAGLFTD DLVLSIAGTP
VAGPGDLQRL LTEDTIGQPV EMTVWRRGAL VDVIAVPREL VTP