Gene Franean1_0900 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0900 
Symbol 
ID5669314 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1051978 
End bp1053417 
Gene Length1440 bp 
Protein Length479 aa 
Translation table11 
GC content74% 
IMG OID641239827 
Productstress protein 
Protein accessionYP_001505262 
Protein GI158312754 
COG category[T] Signal transduction mechanisms 
COG ID[COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.204273 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCACAGG TGCTAAGCAA GGGCGCGAAC GCCCCGCTTC CGACGACGGA TGTGCGGGTC 
GAGGTGTCTT CGTCCACCCC GCTCGACATC GCCGCGTTGC TGGTCACCCC GTCCGGCAAG
GTGCGGGGGG ACGCCGACTT CGTCTTCTTC AACCAGCCGG CCGGTCCGGG GGTGCGCCTC
GCGCCGCCGT CGGCGCTGGA GTTCATGCTC ACCGCGGTGC CGCCCGACAT CGACAAGGTC
GTGGTCACAG GCAGCCTGGA CGGCGCCGGC CCGCCGACCT TCGCCGGCGT GCGCGGCCTG
GCCGTGATCG TGCGGGACGC GCGCGGCCAG GAGGTCGTCC GGTTCGACCC GGCCGGGATG
AGCAGCGAGA CGGCCCTCGT GCTGGTCGAG CTGTACCGCC GTGCCGGGAG CTGGAAGGTG
CGTGCCGTCG GCCAGGGCTA CGCCTCCGGC CTGGCCGGGA TCGCCACCGA CTTCGGGATC
ACCGTCGACG ACCCGGGCTC CGGGAACACC GCGGCGGCAC CAGCGTCCGC GGGCCCCGGA
GCCGGAACGG GGGCCCCGCC ACCGTCGCAG TACGACGCCC CGACGCAGGT CGTCTCGCCC
CCGCCCGGTC AGCAGTGGGG TCCGCCGCCT GGCGCGCCGC CGCAGGCTCC GCCGCCGCCG
AACCCGCAGC AGTGGGGCCC GCCGCCGGGC CAGCAGTGGG GTCCCCCGCC GGGCCAGCAG
TGGGGTCCGC CGCCCGGCGC GCCGCCGCAG GCTCCGCCGC CGCCGAACCC GCAGCAGTGG
GGCCCGCCGC CGGGCCAGCA GTGGGGTCCG CCGCCCGCCG GCCCGCCCGG CCCGGGTGCT
CCCGCCGGCG CGGTGCCCGG CCGGGTGAAC CTGGACAAGG GCCGGGTCTC GCTGCGCAAG
GGCCAGAGCG TGTCCCTGGT CAAGACCGGC GCCCCGCCGC TGGTCCGGGT CCGGATGGGT
CTCGGCTGGG ATCCCGCGCA GCAGGGCCGC TCCATCGACC TCGACGCGTC CTGCATCCTG
TTCGACGACC GCGGGAAGGA CGTCGACAAG GTCTGGTTCA TGTCGAAGAA GGGGGCGCGT
GGCGCTGTCC GCCACTCGGG GGACAACCTC ACCGGCCAGG GCGAGGGCGA TGACGAGACC
ATCTTCGTCG ATCTCGGCGC GCTGCCGCAG AACGTCGTCA GCCTGATCTT CACGGTGAAC
AGCTTCCAGG GGCAGTCCTT CACCGACATC CGCAATGCCT TCTGCCGGCT CGTCGACGAC
CAGACCAACC AGGAGCTGGT GCGGTTCGAC CTGTCCGAGT CGAAGCCGGC GACGGGACTG
GTGATGTGCC GTCTCCAGCG GGAGCCGGGG GCGCCAACCT GGGTGATGAC CGCGATCGGC
GAGTTCCACG ACGGGCGTAC CGTGCGCGCG ATGGTCGGGC CGTCCCGCCA GTACCTCTGA
 
Protein sequence
MAQVLSKGAN APLPTTDVRV EVSSSTPLDI AALLVTPSGK VRGDADFVFF NQPAGPGVRL 
APPSALEFML TAVPPDIDKV VVTGSLDGAG PPTFAGVRGL AVIVRDARGQ EVVRFDPAGM
SSETALVLVE LYRRAGSWKV RAVGQGYASG LAGIATDFGI TVDDPGSGNT AAAPASAGPG
AGTGAPPPSQ YDAPTQVVSP PPGQQWGPPP GAPPQAPPPP NPQQWGPPPG QQWGPPPGQQ
WGPPPGAPPQ APPPPNPQQW GPPPGQQWGP PPAGPPGPGA PAGAVPGRVN LDKGRVSLRK
GQSVSLVKTG APPLVRVRMG LGWDPAQQGR SIDLDASCIL FDDRGKDVDK VWFMSKKGAR
GAVRHSGDNL TGQGEGDDET IFVDLGALPQ NVVSLIFTVN SFQGQSFTDI RNAFCRLVDD
QTNQELVRFD LSESKPATGL VMCRLQREPG APTWVMTAIG EFHDGRTVRA MVGPSRQYL