Gene Francci3_0254 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0254 
Symbol 
ID3903662 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp293551 
End bp294774 
Gene Length1224 bp 
Protein Length407 aa 
Translation table11 
GC content74% 
IMG OID637877582 
Productpeptidase M48, Ste24p 
Protein accessionYP_479371 
Protein GI86738971 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0501] Zn-dependent protease with chaperone function 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones27 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACGACCG CGGCACTTCT CGCCCTGTTC GCGGGCGTGT TGGCCTGGCC GGTCCCACGC 
CTGCTGGCCG CCGCGCGCTG GCCCTACCGA TGTCCACGGG CGGCGATCGT GCTCTGGCAG
GCGGTCGGTC TGGCCGGCGG AGTGTCGGCG CTACTCGCGG CGATCGCGTT CACCGTCTCA
CCGCTGTCCG GCGACACCCC GACCGCCATC GTCGACCATC TCGACAACAT CGCGGCCGGC
GCGCCGCTGA CGGGCCTGGG CCTGATCAAC CTCATCGGCC TCGCTGTGGC GGCGGCCTTG
GCCTCCCGGC TGTTCGGGGT GCTCGGCACC TCCAGCGCCG CCACCCTTCG CGAGCGTCAC
CGACACCGCG ACCTGGTCGA TCTGGCTGGC CGTCGGCATC GCCGGCATGA GACCTGCGTC
CCGCCCGGCC ATGCGTCCCA TCATTCCGGC ACAGCCGCGG ACTCTGGTGC GGCCACCGAC
CCTGGCACAA CCGCCGACGG CGGGGGCGCA ACCGGCTGCT GCCTGTGCGA ACACCGGGAC
CGGGCCGGCA TTCTACTGCG TATCCTCGAT CATCCCGTCG CGGTCGCCTA CTGCGTGCCG
GGGGTGCGGC ACGCCCGGGT TGTGGTCTCC CGCGGGCTAC TGAACACCCT CGACGCCACC
GAACTGGACG CCGTGCTCGC CCACGAGGCG GCCCACGTCG CCGGTCGTCA CGACCTCGTC
ATCCAGCCGT TCGTCGCGTG GGAGCGGACC TTCCCGTTCC TGGACCCGGC CAGGGAGGCG
ACCGCGGCCG TCTCGCTCCT CGTCGAGATG CTCGCCGACG ACGCGGCGGC CCGGCAGACA
AGCCGCCGAT CGCTCGCGCG GGCCTTGGCC CGGCTGGGCG GGGCAAGGGC ACCGGTCCCC
GCCGGGGCGC TGGGGATCAT CGATTCTCCG GCCGCTGTGC TGCCCGCACT CGGCTCCCAC
GAGCCTACGG CGGTCGGCGT CCGCGATTCC GGCCCGCCCG GCCCGGGCAA CCCAGACGCC
GGCGGGCCAG ACCCCGGCGA TGCGGCGCCC GGCCAGACAC GACCGGTCGG CGGCGGCCCC
CGGCGCGGCA CGGCCAATCC GGTCATCTCG CGGATCACCC GGCTCCTCGA CCCGCCGGAA
GTGCCATGGT GGATGCCGCC GTGCGCCTAC TTCGGGGCGA TCATCGTGCT CGCCGCGCCC
CCGCTTATTC TCCTAGTCGG CTGA
 
Protein sequence
MTTAALLALF AGVLAWPVPR LLAAARWPYR CPRAAIVLWQ AVGLAGGVSA LLAAIAFTVS 
PLSGDTPTAI VDHLDNIAAG APLTGLGLIN LIGLAVAAAL ASRLFGVLGT SSAATLRERH
RHRDLVDLAG RRHRRHETCV PPGHASHHSG TAADSGAATD PGTTADGGGA TGCCLCEHRD
RAGILLRILD HPVAVAYCVP GVRHARVVVS RGLLNTLDAT ELDAVLAHEA AHVAGRHDLV
IQPFVAWERT FPFLDPAREA TAAVSLLVEM LADDAAARQT SRRSLARALA RLGGARAPVP
AGALGIIDSP AAVLPALGSH EPTAVGVRDS GPPGPGNPDA GGPDPGDAAP GQTRPVGGGP
RRGTANPVIS RITRLLDPPE VPWWMPPCAY FGAIIVLAAP PLILLVG