Gene Francci3_3554 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3554 
Symbol 
ID3904493 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4250360 
End bp4251763 
Gene Length1404 bp 
Protein Length467 aa 
Translation table11 
GC content71% 
IMG OID637880875 
Productprocessing peptidase 
Protein accessionYP_482635 
Protein GI86742235 
COG category[R] General function prediction only 
COG ID[COG0612] Predicted Zn-dependent peptidases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.106365 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.417448 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGACA CTCCCACGCC CGACCTGACC TGCTCAGCGG GGACCGCCGG CGCGCCGCGG 
AACAAGCCGG TTCCCGGCGC CCGGGCCGCG CACCTGCTCG CTGCTGGCGG AGTCAGTGGC
GAATCGTTGC TGGACGGCAC GGCGCGCCGC ACCGTGCTCC CCGGCGGTCT GCGGGTCATC
ACCGAGCGGG TCCCCGGGGT GCGCTCGGTC GCGATCGGCG TCTGGGTCGC GGTGGGCTCG
CGCGACGAGA CGCCGGTGAC CGCCGGATGC TCCCACTACC TGGAGCACCT GCTGTTCAAG
GGCACGCCCA GCCGGGACGC GTTGACTATC AGCGCGTCGG TCGAGGCGGT CGGCGGGGAC
ATCAACGCCT TCACCGGCAA GGAGTACACC TGCTACTACG TGCGGGTGCT CGACTCCGAT
CTCGCGATGG CCGTCAACGT CATCGCCGAC ATGGTCACCA ACTCGCTGGT GACAGCCGAC
GACGTCGAGG CCGAGCGTGG CGTGATCCTG GAGGAGATCG CCATGTACGA GGACGACCCG
GGTGACCTCG TCCATGACGT ATTCGCGGCG GCGATGCTCG GGTCGTCGGT ACTCGGCCGC
CCGGTGCTCG GGACGACCGA GTCGATCGAG GGGCTGGGCC GGGAGACCAT CGCGGACTAC
TACCGCTCCC GGTACGTCCC GCCGGCGATG GTGGTCTCCA TCGCCGGCAA CCTGGCGCAC
GACCGGGCGC TGGCGCTGGT CGCCGAGGCG TTCGCCGACC GTCTGACGGT GAGCGCCGAG
CCGTTCGAGG TGCGTGGCGG ATCGTACGAC TACCCGCCGC CGCCCGGCAT CGTCGTGACC
GACCGGCCGA CCGAGCAGGC GCATCTCGTG CTCGGCACCA GGGGACTGTC CCGGCACGAT
CCGCGCCGTT ACACGCTCGG CGTGCTGTCG ACGGCGCTGG GCGGTGGGAT GAGCTCGCGG
CTGTTCCAGG AGATCCGGGA GAAGCGGGGG CTGGCGTACT CGGTGGGCTC GTTCGCCTCC
CACTTCGCCG ACGCGGGCCT GTTCGGCGTC TATGCCGGAT GCGCCCCCAA GCGTGCCGAC
GTGGTCCTCG AGCTCGCCCG TGAGCAGGTA CGCCAGATCG CCGAGCACGG GATCAGTGCC
GAGGAACTCG ACCGGGCCCG TGGGCAGAAC CGCGGCTCGA TGATCCTCGG CTTGGAGGAC
ACCGGATCCC GGATGAGCCG CCTCGGCAAG AGCGAGCTCG TCCACGGTGA GGTCCTGTCG
GTCGACGAGA TCATCGCCCG GGTGGACGCG GTCACCCTCG ACGATGTCAC CGCGATCGCG
CGGGAGCTGC TGGACCAGTC CTGGGCCCTG GGCGTCATCG GACCCTTCGA CGACCACGAC
TTCAGCGCCG CCGTCGCCCG TTAG
 
Protein sequence
MSDTPTPDLT CSAGTAGAPR NKPVPGARAA HLLAAGGVSG ESLLDGTARR TVLPGGLRVI 
TERVPGVRSV AIGVWVAVGS RDETPVTAGC SHYLEHLLFK GTPSRDALTI SASVEAVGGD
INAFTGKEYT CYYVRVLDSD LAMAVNVIAD MVTNSLVTAD DVEAERGVIL EEIAMYEDDP
GDLVHDVFAA AMLGSSVLGR PVLGTTESIE GLGRETIADY YRSRYVPPAM VVSIAGNLAH
DRALALVAEA FADRLTVSAE PFEVRGGSYD YPPPPGIVVT DRPTEQAHLV LGTRGLSRHD
PRRYTLGVLS TALGGGMSSR LFQEIREKRG LAYSVGSFAS HFADAGLFGV YAGCAPKRAD
VVLELAREQV RQIAEHGISA EELDRARGQN RGSMILGLED TGSRMSRLGK SELVHGEVLS
VDEIIARVDA VTLDDVTAIA RELLDQSWAL GVIGPFDDHD FSAAVAR