Gene Franean1_0858 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0858 
Symbol 
ID5669274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1004665 
End bp1005849 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content73% 
IMG OID641239787 
Producthypothetical protein 
Protein accessionYP_001505222 
Protein GI158312714 
COG category[S] Function unknown 
COG ID[COG3503] Predicted membrane protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.581256 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.122318 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGGCG CGAACCCTGC AGGAACCGGA CGGATAACCG GGGTCGACAT CGCTCGCGGC 
GTGGCGCTGC TCGGCATGGT GGCGACCCAC GTCTATCCCC CGTTCACCGA CACCGCGTCG
GACGACCCGG CGGTCTCGCC CGCGTTCATC CTGGCCGCGG GGCGGGCCGC TGCCGCGTTC
GCCGTACTCG CCGGTGTGGC TCTGGTGTTG TCCACCCGAC GGCAGAGCGC GGGCCAGGCC
CGGCTCTCGG TGTTCCTGCG GGCGCTCGGC ATCGGGGCAC TCGGCCTGGG CCTCGCCTAT
GCCGACTCTG GTATCGCCGT GATCCTGGTC TACTACGCGC TGTTGTTCGT CCTGGCGCTG
CCGCTGCTGC GGGCATCCGT GCCGGTACTG ATGACGGTCG CCGTCCTGGC GATCTTCGCG
GAGCCCGTCG TCAGCCAGTT CGTGCGCGGT GACCTGCCCG AGTCGGACCT GTCGTCGCCC
ACGTTCGCCG CGCTGGGCGA GCCGGGCCAT CTCCTCGCGA AGCTGGCGAT CACCGGGGTC
TACCCCGCGT TCGCCTGGCT GGGCTACATC TGCGTCGGGA TGGCCGTCGC CCACGCGGAC
CTGCGCTCAC GCCGGGTGGC GACCCGGCTG CTCGTCGGCG GCCTGGCGCT CGCCCTTGCG
GCGGCGGCCG CGTCCTGGCT GCTGTTGGAG CCGCTCGGCG GGCGCGCCGA GCTCGCCAAC
CCGGCCGAGG TTCCCGGTGT GGGCTCACTG CCGCAGGGCT GGTTCATCGA TTCCGGGCTG
TACGGCGCGA CGCCCACCGA CAGCGCCTGG TGGCTGGCCG TCGACACGCC GCACTCGACG
ACTCCGTTCG ACCTGGCGCA CACCACCGGG ACGGCACTGG CGCTACTCGG CCTTGCCCTG
CTGGTGGCGC GGGTACCGCT GGTCCGGCCA CTCGCGGCCG TCGGCGCGAT GACGCTCACC
TTCTACTCGC TGCACGTGGT GGTGATGGCC ACCGGCGTGC TGCCCACCGA TCCGACCAGG
TCGTATGTGC TCCAGGTGGT CGTCGCGCTC GCGGCCGCGA CGCTCTGGCA CATGACCGGC
CGGCGCGGCC CGGCCGAGGC CGCCGTCTCG GTGCTCCCCC GGGCGGCCCG GCTCGTCCAG
GCCCCGCGGC GACCCGCGGT CGGGCTCGAG CGAGGAACTG GCTAG
 
Protein sequence
MDGANPAGTG RITGVDIARG VALLGMVATH VYPPFTDTAS DDPAVSPAFI LAAGRAAAAF 
AVLAGVALVL STRRQSAGQA RLSVFLRALG IGALGLGLAY ADSGIAVILV YYALLFVLAL
PLLRASVPVL MTVAVLAIFA EPVVSQFVRG DLPESDLSSP TFAALGEPGH LLAKLAITGV
YPAFAWLGYI CVGMAVAHAD LRSRRVATRL LVGGLALALA AAAASWLLLE PLGGRAELAN
PAEVPGVGSL PQGWFIDSGL YGATPTDSAW WLAVDTPHST TPFDLAHTTG TALALLGLAL
LVARVPLVRP LAAVGAMTLT FYSLHVVVMA TGVLPTDPTR SYVLQVVVAL AAATLWHMTG
RRGPAEAAVS VLPRAARLVQ APRRPAVGLE RGTG