Gene Francci3_3945 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3945 
Symbol 
ID3906904 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4722728 
End bp4724071 
Gene Length1344 bp 
Protein Length447 aa 
Translation table11 
GC content75% 
IMG OID637881272 
Productpeptidase S8/S53 subtilisin kexin sedolisin 
Protein accessionYP_483024 
Protein GI86742624 
COG category[O] Posttranslational modification, protein turnover, chaperones 
COG ID[COG1404] Subtilisin-like serine proteases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.542591 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCACCACC GCGCCATGCT CGGCCTGCTG GTCGGCCTCG CGCTGCTGGT CGGCCTCGTG 
ATGACGACCG CTCCGGCCGG CGCCGCCGTC CCGCCGTACG CGGCCGGGCC CGGCCCGGCT
TCGATCGCGC CCGCCCGAGG CTCGCAGTGG TACCACGCTG TCCTGGGACT CGCCCAGGCG
CACCGGATCA GCCAGGGCGC GGGCACGGTG GTGGCCGTCA TCGACGGCGG GGTCGACGCC
AGCCAACCCA AGTTGTGGGG TCAGCTGCTT CCCGGTACCG GCATCGGCCC CGGGGCGGCG
CGGGACGGTT GGCGCGACGA TGATCCGAAC GGCCACGGCA CCGCCATGGC GGGCATCATC
GCCGGTCGCA ACGACAACGG CCGGCCCGAG GTCCGCGGCA TCGCCCCGGC CGCCAAGATC
CTGCCCGTCT CGACCGGCGC CGAGGCGAAC TCGGAGGAGG TCGCGATCGG CATCCGCCGG
GCAGTCGACC TGGGTGCCGA CGTCATCAAC CTCTCCCTGG GCTCGACGGG GACGGCGACC
CCGGACGAGG AGAAAGCGGT CGGCTACGCC CTCGCGCACG ACGTGGTGGT GGTCGCCTCG
GCGGGGAACG TCGAGTCCGG CGACACCGCG ATCAACTCCC CCGCCAGTAT CCCGGGGGTG
GTCGCGGTGA CCGGATCGAC GGCCGCTGGT GGTTTCTGGC GGGAATCGGC CCACGGGCCG
CGGGCCGTCA TCGCCGCGCC GGCCCCCGGT ATCCGGGCCC CCGTCCCGAC CCGGGTCTCC
CCGGACGGCC TGGACACCGG GGGCGGCACC TCGAACTCCG CGGCGATCGT CGCGGGCGTT
GTCGCCCTCA TCCGGGCCGC CCAGCCGGAC CTGGACGCAC CCAACGTCAT CGAGCGGCTC
GTCTACACCG CGCGGGATGC GGGCTCTCCT GGCCGCGACG ACGAGTTCGG CTTTGGCATC
GTCGACCCCG TCGCGGCGCT GACCCGGGCC GTGCCGGTGG TCAGGAGCAA TCCGCTGCTG
TCGGCACCGA CACGCTGGGG GGTGGGGGGG CCCGCGCCGG CCGCCGGCCG CATCATGCCC
GGCGGGCAGG CCCGGGGGTC CGCCGGCGAC GCGACGACCC ACGGGGCCGG CGCCACACTC
ACGAACACCG GTCCGATCGG CGCAGCCGGC CCAGACTCAT CGAAACCCTC GCCACTGGTC
TGGACCGCCG GGCTCGGGAT CGCGGCCTCG TTGGGCGTCC TGCTCGGGAT CGTGACACAT
CTGCTGTACG CCTGGCAGCG CGCGACGCGG GCGGGACCGG GCCGCGCGGG CAGCCGCGTC
CGCCCGCGCT CCCCTACCGG GTGA
 
Protein sequence
MHHRAMLGLL VGLALLVGLV MTTAPAGAAV PPYAAGPGPA SIAPARGSQW YHAVLGLAQA 
HRISQGAGTV VAVIDGGVDA SQPKLWGQLL PGTGIGPGAA RDGWRDDDPN GHGTAMAGII
AGRNDNGRPE VRGIAPAAKI LPVSTGAEAN SEEVAIGIRR AVDLGADVIN LSLGSTGTAT
PDEEKAVGYA LAHDVVVVAS AGNVESGDTA INSPASIPGV VAVTGSTAAG GFWRESAHGP
RAVIAAPAPG IRAPVPTRVS PDGLDTGGGT SNSAAIVAGV VALIRAAQPD LDAPNVIERL
VYTARDAGSP GRDDEFGFGI VDPVAALTRA VPVVRSNPLL SAPTRWGVGG PAPAAGRIMP
GGQARGSAGD ATTHGAGATL TNTGPIGAAG PDSSKPSPLV WTAGLGIAAS LGVLLGIVTH
LLYAWQRATR AGPGRAGSRV RPRSPTG