Gene Franean1_1147 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1147 
Symbol 
ID5669560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1369759 
End bp1371324 
Gene Length1566 bp 
Protein Length521 aa 
Translation table11 
GC content72% 
IMG OID641240079 
Productsignal recognition particle protein 
Protein accessionYP_001505507 
Protein GI158312999 
COG category[U] Intracellular trafficking, secretion, and vesicular transport 
COG ID[COG0541] Signal recognition particle GTPase 
TIGRFAM ID[TIGR00959] signal recognition particle protein 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0335878 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTTCGACA CCCTTTCCAG CCGCCTCGAC AAGGTCTTCA CGTCGCTGCG TGGCAGGGGG 
CGGCTGACCG ACGCCGACAT CGACGCCACC GCCCGTGAGA TCAGGGTGGC GCTGCTGGAG
GCCGACGTCG CGCTGCCGGT CGTCCGCGGT TTCGTCGCCG CCATCCGCGA GCGGGCCCGC
GGGGCCGAGG TGAGCACCTC GCTCAACCCG GCGCAGCAGG TCATCAAGAT CGTCAACGAG
GAGCTCGTCG CCATTCTCGG CGGCGGCACG ACCACGTTGC GCTTCGCCAA GACGCCGCCG
ACCGTGATCC TGCTCGCCGG CCTGCAGGGA ACCGGCAAGA CGACCCTCGC CGGCAAGCTC
GGCCGCTGGC TGCGAGCCCA GGGGCACACG CCGCTGCTCG TCGCCGCCGA CCTGCAGCGC
CCGAACGCGG TGAACCAGCT CCAGGTCGTC GGCCAGCGGG CCGGGGTCGA GGTCTTCGCC
CCGGAGCCCG GCAACGGTGT CGGTGACCCG GTGCGGGTCG CCCGCGACGC GCTCGCCCAC
GCCCGTCGCC ACGTCTTCGA CGTGGTGGTC GTCGACACGG CCGGCCGCCT CGGTGTCGAC
GAGGAGCTGA TGCGCCAGGC CGCCGACATC CGCGACGCCG TCTCGCCGGA CGAGATCCTC
TTCGTCCTCG ACGCGATGAT CGGCCAGGAC GCCGTCTCCA CGGCCCAGGC CTTCGCTGAC
GGGGTCGGCT TCACCGGCGT CGTGCTGACC AAGCTTGACG GTGACGCGCG CGGTGGTGCC
GCGCTGTCGG TGGCCCGGGT GACCGGGGCG CCGATCATGT TCGCCTCCAC CGGCGAGACG
CTCGACGACT TCGACGTCTT CCACCCCGAG CGGATGGCCT CGCGCATCCT CGGTATGGGC
GACGTCCTGA CGCTCATCGA GCAGGCCGAG AAGGCGTTCG AGGTCGAGCA GGCCGAGGCG
ATGGCCGTCA AGATGGCCAA CTCGGAGTTC ACGCTCGAGG ACTTTCTCGA GCAGATGCTC
ACCGTTCGCA AGATGGGCCC GATCGGCAAC CTGCTCGGGA TGCTGCCCGG AATGGGGCAG
ATCAAGGACC AGCTCGCCCA GGTTGACGAC CGCGACCTCG ACCGGGTCGT CGCCATCATC
CGGTCCATGA CGCCGGCCGA GCGGCGGGAC CCGAAGATCC TCCAGGCTTC CCGCAAGGCC
CGGGTGGCGC GCGGGTCCGG CGTGACCGTC ACCGAGGTCA ACCAACTGCT GGACCGCTTC
GGTGAGGCGC GCAAGATGAT GCGGCAGATG GCCGGCGGCG CCGGCCTGCC GGCGGGAATG
GCACGCGCGA AGGCGGCCCA GGCCCGCAAG GCCGCCAAGA AGGGCAAGGG CGCGCGGCGC
AGCGGAAACC CGGCGGCGCG CGCCGCGCAG GCGCAGGACC GCCGCAACGC GCCCCAGGAC
GGCCCGCCGG CGCTCGGCCT TGGGGGCGAC GGCCTGCAGG GCATGCCTGA CCTGTCCTCG
CTCATCCAGC AGGGTGGCTT CGGGGGCGGC GACACCCCGC CGCGGCGTCC CCGCCCAGGC
CGCTGA
 
Protein sequence
MFDTLSSRLD KVFTSLRGRG RLTDADIDAT AREIRVALLE ADVALPVVRG FVAAIRERAR 
GAEVSTSLNP AQQVIKIVNE ELVAILGGGT TTLRFAKTPP TVILLAGLQG TGKTTLAGKL
GRWLRAQGHT PLLVAADLQR PNAVNQLQVV GQRAGVEVFA PEPGNGVGDP VRVARDALAH
ARRHVFDVVV VDTAGRLGVD EELMRQAADI RDAVSPDEIL FVLDAMIGQD AVSTAQAFAD
GVGFTGVVLT KLDGDARGGA ALSVARVTGA PIMFASTGET LDDFDVFHPE RMASRILGMG
DVLTLIEQAE KAFEVEQAEA MAVKMANSEF TLEDFLEQML TVRKMGPIGN LLGMLPGMGQ
IKDQLAQVDD RDLDRVVAII RSMTPAERRD PKILQASRKA RVARGSGVTV TEVNQLLDRF
GEARKMMRQM AGGAGLPAGM ARAKAAQARK AAKKGKGARR SGNPAARAAQ AQDRRNAPQD
GPPALGLGGD GLQGMPDLSS LIQQGGFGGG DTPPRRPRPG R