Gene Franean1_0300 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0300 
Symbol 
ID5668724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp354886 
End bp356019 
Gene Length1134 bp 
Protein Length377 aa 
Translation table11 
GC content73% 
IMG OID641239230 
Producthypothetical protein 
Protein accessionYP_001504672 
Protein GI158312164 
COG category[S] Function unknown 
COG ID[COG5282] Uncharacterized conserved protein 
TIGRFAM ID[TIGR03624] putative hydrolase 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.262799 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.348803 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGCTG AGCTGGTGGA CTGGGAACTC GCCGTAACGA CGGCGAAGAA GCTCGTCCGA 
CCAGGGCCGC AGCGAAGCCG GGCGGAGGCG GACGAGATCG TCTCGGAGCT CCGGCGCCTG
GCGGTCGTCG CCGAGGGCCA TGTGCAGGAC TACACCCAGC TCGTCCCCGC CGGACCACCG
ACCCCGATCG CGGTCGTCGA CCGGCCGGAG TGGGTTCGTT CCAACGTCGC CGGGCTGCGC
GTGGCCACCA TGCCCCTGAT CGAGAAGCTC TCCGACCAGA GCCGCGGCCG GCTCGCCGCC
GCGGTGGGCC GGCGGGTCAC CGGTGTCCAG GTCGGGTCCG CGCTCGCCTA CCTCGCGGGC
AAGGTCCTCG GCCAGTTCGA GGTCTTCCTC CCGCCGGAGG AGTACGAGGC GGGCAGCGCC
GCGAGCGCCC CGTCTCTGGC GAAGCCCGGC GCTCCGACCC CGGTGGGGCG GCTCAGCCTC
GTCGCGCCGA ACATCGCCCA CGCCGAGGAG ACCCTGCGGG TGGTCCCACG CGACTTCCGG
CTCTGGGTCT GCCTGCACGA GCAGACGCAC CGCAGCCAGT TCACCGCCGT CCCGTGGCTG
CGCGAGCACC TCGAGTCCGA GATCGCGGCG TTCATCGGCG CGACCGACCT CGATCCCGAT
GTCCTCGCCG ACCGGCTCCG CTCCGCCGTC ACGGCGCTGC GCAGCGCCGT GCGCGACCAC
GGGCCGGACA CGCCGAGCGT CGTGGAGGCG TTGCAGACCC CGGCGCAACG CGCCGTCCTC
GACCGCCTCC AGGCGCTGAT GAGCCTGCTC GAGGGGCACG CCGACCAGGT CATGGACGCG
GTCGGCCCGC AGGTCGTGCC GACGGTGGCC GACATCCGCG GCAAGTTCGA CAACCGGCGC
TCCGGCGGCT CGCCCATCGA CCGCTTCCTA CGTCGCCTGC TCGGGCTGGA TCTCAAGATG
CAGCAGTACC GCCAGGGCGG GGCGTTCGTC CGCGCCGTGG TCGCCGAGGT CGGCGTGGAG
GGCTTCAACC ACGTCTGGCA GTCGCCGCGG ACCCTGCCCA CCCGCCCTGA GCTGACCGAC
CCGGGCGCGT GGATGCTCCG GGTGCTCGGC ACCCGCCCGT CGATGTCCGC GTGA
 
Protein sequence
MDAELVDWEL AVTTAKKLVR PGPQRSRAEA DEIVSELRRL AVVAEGHVQD YTQLVPAGPP 
TPIAVVDRPE WVRSNVAGLR VATMPLIEKL SDQSRGRLAA AVGRRVTGVQ VGSALAYLAG
KVLGQFEVFL PPEEYEAGSA ASAPSLAKPG APTPVGRLSL VAPNIAHAEE TLRVVPRDFR
LWVCLHEQTH RSQFTAVPWL REHLESEIAA FIGATDLDPD VLADRLRSAV TALRSAVRDH
GPDTPSVVEA LQTPAQRAVL DRLQALMSLL EGHADQVMDA VGPQVVPTVA DIRGKFDNRR
SGGSPIDRFL RRLLGLDLKM QQYRQGGAFV RAVVAEVGVE GFNHVWQSPR TLPTRPELTD
PGAWMLRVLG TRPSMSA