Gene Francci3_0069 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0069 
Symbol 
ID3905404 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp86820 
End bp88559 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content70% 
IMG OID637877399 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_479192 
Protein GI86738792 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACACCGC GTCTCCCCTT CAGCAGTAGC GCCGAGGCCG CCCCCGGGGG CGGCGGCAGG 
GAGGGGTCCG GCGCCGGGCA TGACTCCCCC TCCCCCTGGG CTCCGCCCAT CGGCAGCGCG
GCCGGGACAC CAGCCGGCCA AGCTCACCCG GCTGGTCCCG CCAGCTCCGA ACCCGACCTC
CCGGCGGGCC CGCCTTCCCG TCCGCCCGGC TCGCCGTACA GCCAACCGCC GGCGCTGGGG
CCATCCTCCG GCCAGCATCC TCAGTATCCG AATCCCCATC CCCCGCATGG GGATCCTCGT
CCGTATCCGG ATTCTCCGTA CCCGGCCGGG CCGTCCCAGG CCAGCCAGCA GCAGGCCGCG
TTCTCCGCGG GAAGCCCACC TCCGGCGGCG CCGCCCGGCG GCCCGTGGGG ACCACCTTCA
GGCCCGCCAC CCAGTGGTCC CGCTGCCCCG AGAGCTCTGA ACGGCGCGGC CGGCCCCGCG
GGCCCCCCCG CGGGACCCCC GACCGGAGGG CATTCCTGGG GACCGCCGTG GAGTTCCCCG
AGCGGCGCCG GCTCGCCACC CGCCCAGGAC CTCTACGGCA ACGGGACAAC CATGGCCGCC
TCGCCGCCTT GGCGCCGACG GCGGCTCGTC GCCGCCGGAC TCGCCATTGC CCTGGTCTCA
GCTGGTGTAG GTGGCGGTGT CGGAGCGCTG GTCGCCGATA ACAACGGGGG GCAGACCATC
GTCACATCCG CCGGTCTGCA CAACACCGTG GACAGTTCCG GTGGAACGTC GCCGGCAGCC
GCGAACACCG TGTCGGCCGC GGCGCAGAAG ATCCTGCCGA GTGTCGTGAC AATCTCCGAG
GAATCGAGCA GCGAGTCGGG CACCGGCTCC GGCACCATCA TCCGTTCGGA CGGGCATATC
CTGACGAACA ACCATGTGGT CTCGGGTGCC GCGAACGGCG GTTCGCTGAC GGTCACCCTG
CAGGACGGTC GTACCTTCGA TGCGCAGGTC GTAGGCACGG ATCCGAGCTC GGACCTCGCG
ATGATCAAAA TCAATGCCAC CGGTCTCACT GCGGCCACGT TCGGCAATTC CGACACGCTG
AACATCGGGG AACTGGTGGT AGCGGTCGGC AGTCCGCTCG GGCTGAACGG TACGGTCACG
TCCGGCATCG TCAGTGCCGT GCATCGCCCG GTGCGCACCG GGGATTCAAC CGTGCGGGAT
CAGCAGAACA CCGTGCTCGA CGCAATCCAG ACCGACGCAT CGATCAACCC CGGTAACTCC
GGTGGTCCGC TGGTCAACAG TCGCGGCGAG ATCATCGGCG TGAACAGCGC GATCGCGACC
GTGGGTGGTG GAAGTCCCTT CGGTGGCGGC CAGCAGTCCG GCAACATCGG CGTCGGTTTC
GCTATTCCGG GCAACTATGC CGAGTCGGTG GCCACCCAGT TGATCTCCAC GGGAAGCGCC
CGGCACCCGT ACCTGGGCGT CAGCGCCTCC ACCGCGGAGG AGAACACCCG CTCCACCGCC
TCCAGCGGCA ACGGTGCACA GATCCGCTCC ATGGTTCCGG GTGGACCCGC CGAAAGGGCC
GGCCTTCGCA CCGGCGACGT CATCACGAAG GTCGGCAACC GTGCCGTCAA CGACGTCGAC
TCCCTCATCG CGGCGGTCCG GTCCCACGCC ATCGGCGACG AGGTGGAAGT GACCTATACC
CGTGATGGAC AGAGCGGCAC CGTCAAGGCC CGGCTCGCCC AACAACCACC GGCATCCTGA
 
Protein sequence
MTPRLPFSSS AEAAPGGGGR EGSGAGHDSP SPWAPPIGSA AGTPAGQAHP AGPASSEPDL 
PAGPPSRPPG SPYSQPPALG PSSGQHPQYP NPHPPHGDPR PYPDSPYPAG PSQASQQQAA
FSAGSPPPAA PPGGPWGPPS GPPPSGPAAP RALNGAAGPA GPPAGPPTGG HSWGPPWSSP
SGAGSPPAQD LYGNGTTMAA SPPWRRRRLV AAGLAIALVS AGVGGGVGAL VADNNGGQTI
VTSAGLHNTV DSSGGTSPAA ANTVSAAAQK ILPSVVTISE ESSSESGTGS GTIIRSDGHI
LTNNHVVSGA ANGGSLTVTL QDGRTFDAQV VGTDPSSDLA MIKINATGLT AATFGNSDTL
NIGELVVAVG SPLGLNGTVT SGIVSAVHRP VRTGDSTVRD QQNTVLDAIQ TDASINPGNS
GGPLVNSRGE IIGVNSAIAT VGGGSPFGGG QQSGNIGVGF AIPGNYAESV ATQLISTGSA
RHPYLGVSAS TAEENTRSTA SSGNGAQIRS MVPGGPAERA GLRTGDVITK VGNRAVNDVD
SLIAAVRSHA IGDEVEVTYT RDGQSGTVKA RLAQQPPAS