Gene Francci3_3589 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3589 
Symbol 
ID3904143 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4288524 
End bp4289582 
Gene Length1059 bp 
Protein Length352 aa 
Translation table11 
GC content68% 
IMG OID637880910 
Productsignal peptidase I 
Protein accessionYP_482670 
Protein GI86742270 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0681] Signal peptidase I 
TIGRFAM ID[TIGR02227] signal peptidase I, bacterial type 


Plasmid Coverage information

Num covering plasmid clones31 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones18 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATGCAT CCGATCCCAG TGGGCCTCGG GTACCGGGAA GAACCGATGG CGGTCAGCCG 
GACGGTGCGT GGCCGAGCCC TGCGGCCGAG CCTGACGGCC CGAAGGACCA GGCACAGCCC
GACGGTCCGG AAGCCACCGG TCGTCGGGTT TCCCGGCCCG AGGCGGCGCG GACCCGTGGC
CGTGGCTCGT TCCTCCGCGA GCTCCCGGTC CTTGTGCTGA TCGCCTTTCT GCTCGCCCTG
CTCATCAAGG CTTTTCTGGT CCAGGCGTTC TGGATCCCCT CGGAGTCGAT GGAGCGGACG
CTGCTGGTTG ACGACCGGGT TCTCGTCAAC AAGGTCGTCT ACCACTTCCG CGACGTGCAC
CGGGGTGAGA TCGTCGTCTT CAATGGCAAG GGAACCGGAT TCGATCATGC CGAGTCCGTC
GTCCCGCCGC CGAGCAACGC GTTCAGCAGG TTCGTTCGTG GCGCGCAGAA CCTGTTGGGT
CTCGGGGCCC CGAGCGAGAC CGACTTCATC AAACGCGTCA TAGCGGTCGG CGGCGACACG
GTCGCGTGTT GCGATACCGC GGGCAGAGTC TCGGTCAACG GCCATCCGCT CGACGAGCCG
TACGTCTACC AGAACGACTA TCAGCGGTTC GGTCCGCTGA CCGTCCCGGC TGGCTACCTG
TGGGTGATGG GTGACCATCG CGGGGCCTCC TCGGATGCCC GGCAGAACGG ACCGATCCCG
AAGCATGCGG TGGTGGGACG GGCCTTCGTC CGTGTCTGGC CACTTGGCCG GTTCGGATTC
CTGGGCGTTC CGAACGATTT CGCCGGGATT CCCGCCGCGT CGGTGCTGCC TCCGGCTCCT
CGTGCGTCGG CCGGGTCGAC CACATCCGCC GCGCTGTCTT CTACCGGCGA CGCCGTGGTG
CCGCTGCCGG GCTACCCGGC CACGGATGAC GTATCCCTGT TCGCCCTTGC CGTGCTGATC
CCTCCGGCGT GGACCGCCAC CCGTCGGCCT CGCCGGATGC CACGAGGATC CACGCGCACG
CGCGCACCAG GGACGGCTCG CCGCCGCGGT CCGTCATGA
 
Protein sequence
MNASDPSGPR VPGRTDGGQP DGAWPSPAAE PDGPKDQAQP DGPEATGRRV SRPEAARTRG 
RGSFLRELPV LVLIAFLLAL LIKAFLVQAF WIPSESMERT LLVDDRVLVN KVVYHFRDVH
RGEIVVFNGK GTGFDHAESV VPPPSNAFSR FVRGAQNLLG LGAPSETDFI KRVIAVGGDT
VACCDTAGRV SVNGHPLDEP YVYQNDYQRF GPLTVPAGYL WVMGDHRGAS SDARQNGPIP
KHAVVGRAFV RVWPLGRFGF LGVPNDFAGI PAASVLPPAP RASAGSTTSA ALSSTGDAVV
PLPGYPATDD VSLFALAVLI PPAWTATRRP RRMPRGSTRT RAPGTARRRG PS