Gene Franean1_0940 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0940 
Symbol 
ID5669354 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1100652 
End bp1102010 
Gene Length1359 bp 
Protein Length452 aa 
Translation table11 
GC content75% 
IMG OID641239867 
Producthypothetical protein 
Protein accessionYP_001505302 
Protein GI158312794 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.685586 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGCG GCGGCTTCCC CTTCGGCTTC GGCCCCACTC CGGGTGGCGA CCCGGAGCGC 
CCCGCTGGCG GGGCCCCGTT CTTCGCCGAG CTCGAACGGC TGCTCTCCTG GCAGGGCGGT
CCGGTCAACT GGGAGCTCGC CCGCCAGGTG GCGGTGCGGA CCCTGGGCGG CGACGACCGC
GCGGTGCGGG CCGCCGAGAC CGGTGAGGTC GAGAAGGCGC TGCGCATCGC CGACGTGTGG
CTCGACCCGA TGACCGCGCT GCCGGCCGGC GCGACCACCG CCGCGGCGTG GTCCCGCGAG
CAGTGGATCG AGGCCACGCT GCCCGTCTGG CGGACGCTGT GCGACCCGGT GGCGGGCAAG
GTCGTCGAGG CCATGCGCAC CGGGATCTCG TCCGGCCTGA GCCAGCTCGG CGGCGGCGAC
CTGCCGCCCG AGCTCGCGGG CGCGCTGCCG CCCGGCGTCG ACCTCGGCCG GCTGATGGGC
GCCGGCGGGC CGGTCGTGCA GATGATGAAC CAGGTCGGCG GCATGCTCTT CGGCGCGCAG
GTCGGCCAGG CCATCGGCAG CCTGGCGGCC GAGGTGGTCA GCTCGACCGA GGTCGGGCTG
CCGCTGGGCC CCGCGGGCAC GGCCGCCCTG CTGCCGGCGA ACGTGGCCGC CTTCGGGCAG
GGTCTGGGCG TCGAGGACGA AGAGGTCCGT ATCTACCTGG CGCTGCGCGA GGCGGCGTCG
AACCGGCTGT ACGCGCACGT CCCCTGGCTG CGGGCGCACG TGCTGGGCGC GGTCGAGGAG
TACGCGCGTG GCATCGCCGT CGACCCGGAG GCGGTCGGCC GCGTGATGCG GATGATCGAC
CCCACCGCGC TGATGAACCC CGAGCGGCTC ACCGAGGCGC TCGGGGAAGA TGTCTTCGCC
GACGCGGACA CCCCCGAGCA GAAGGCGGCA CTGGCCCGCC TGGAGCTGAT CCTGGCGCTG
ATCGAGGGCT GGGTGGACCA CGTCGCGGAC ACCGCCGCGT CCGAGCACCT GCCGTCCGCG
GCGAAGCTGC GCGAGATGGT CCGTCGGCGC CGCGCCGAGG GCGGCCCGGG AGAGCAGATC
TTCTCGACCC TCGTCGGCCT GTCGCTGCGC CCGCGCCGGC TGCGCGAGGC CGCCGCGCTG
TGGGAGGCGT TGCGCGAGGC CCGCGGGCAC GACGGGCGGG ACGCGGTATG GGCGCATCCG
GACCTGTTGC CCGGCGGGGA GGACCTCGCG GACCCGTCCG CGTTCGTCTC CGGCGCCGGA
GCGGGCTCCG ACGTCGACAT CATGGCGGAA ATCGAGAAAC TCGACGACAC GGCGCCCGGA
GAAGACACTC CGCCGGCCTC GGGTGACTCG CCTTCCTGA
 
Protein sequence
MSGGGFPFGF GPTPGGDPER PAGGAPFFAE LERLLSWQGG PVNWELARQV AVRTLGGDDR 
AVRAAETGEV EKALRIADVW LDPMTALPAG ATTAAAWSRE QWIEATLPVW RTLCDPVAGK
VVEAMRTGIS SGLSQLGGGD LPPELAGALP PGVDLGRLMG AGGPVVQMMN QVGGMLFGAQ
VGQAIGSLAA EVVSSTEVGL PLGPAGTAAL LPANVAAFGQ GLGVEDEEVR IYLALREAAS
NRLYAHVPWL RAHVLGAVEE YARGIAVDPE AVGRVMRMID PTALMNPERL TEALGEDVFA
DADTPEQKAA LARLELILAL IEGWVDHVAD TAASEHLPSA AKLREMVRRR RAEGGPGEQI
FSTLVGLSLR PRRLREAAAL WEALREARGH DGRDAVWAHP DLLPGGEDLA DPSAFVSGAG
AGSDVDIMAE IEKLDDTAPG EDTPPASGDS PS