Gene Franean1_6522 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6522 
Symbol 
ID5674837 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7929300 
End bp7930970 
Gene Length1671 bp 
Protein Length556 aa 
Translation table11 
GC content72% 
IMG OID641245370 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_001510765 
Protein GI158318257 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0707299 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGCGGCT GGCGGCTGCG CCGGCGGCTC GCCGCGGTGT ACATCGCGTT GTCCATGGTG 
CTGCTGCTGG CCGCCGGGAT CGTCGCCTGG TCGCTGGTCC GGCTCGACGA CGCCATCCAT
GTGCGCAGCG ACGTCCTGGC GCCCGCGCAG ACAGTCACGG TCCGGTTGAC GTCCAGCCTG
GTCGACCAGG AGACCGGCAT GCGCGGGTAT CTCCTGCAGG GCGACCCCGC GTTCCTGACC
CCGTACGAGA CCGGGCGGCA GTCCGAGGCG AGGCTGAGCT CCGTCCTCGT CGGCCGGGTG
GCCGCCCGCC CGGAGCTCGA GCGGCTGCTC GGGGTCCTCC AGGCGCGGAT CGCGGACTGG
CACCGCGACT ACGCCGACCC GTCCATCGCG GCGGTCCGCC GGGGGGGCAC CGACTCGCCG
GGAATCCTCG ACCCGGTTCT CGGCAAGACC CGGTTCGACG CGGTGCGCGC CGCGGCCGCG
GACTTCGACG ACGCCGTCCG GATCGCCGCG CTGGCGGCCC GTGAGGACGT CACCAACGCA
CTGCGGACGC TCATCATCTC GCTCGTCGTG GCCGCGCTGG CGCTCATCGT GCTGCTGGCA
CTCATCTTCC GAGCGCTGCG GGTGTGGGTG ACGCGGCCGC TGGAGGCCGT CGGCACGGAC
GTCCGCGAGG TCGCCGCCGG GCACCTGGAG CACGCGGTGG AGTCGGTGGG GCCGCCCGAC
ATCGTCACCC TGGCACGGGA CGTCGAATCG ATGCGCCTGC AGCTCATCGG TGAGCTCGAG
GCCGCGCGCA GGGCGCGGCT GTCCACCGAG GCGCAGGCGG AGATCCTGCG CCGGTCCAAC
CGCGACCTGG AGCAGTTCGC CTACGTCGCC TCGCACGACC TGCAGGAGCC GTTGCGCAAG
GTGGCCAGCT TCTGCCAGCT TCTGTCCCGG CGCTACGGCG ACCAGCTCGA CGAGCGCGGG
ACCCAGTACA TCCACTTCGC CGTCGACGGC GCGAAGCGCA TGCAGCAGCT CATCAACGAC
CTGCTCGCCT TCTCCAGGGT CGGCCGCACC ACCGACAGTT TCGTCGACGT CCCGTTGGGC
GAGGTCTTCG AGCGGGTGGT GGGGACGCTG TCGCTCGCCC TGGAGAGCTC CGGCGGCGAG
GTGACCGCCA GCGATCTCCC GGCGGTGCCC GGCGACCCCG TTCTGCTGGC CCAGCTCCTG
CAGAACCTGA TCGGCAACGC GCTGAAGTTC CGTGGCGACG AGGCCCCGCG GGTGCGGATC
GGGGCGATCG ACCGCGACGT CGAGTGGGAG CTGTTCTGCG CGGACAACGG CATCGGCATC
GAGCCGGAGT ACGCCGAGAA GATCTTCGTG ATCTTCCAGC GCCTGCACGC GCGGGACGTC
TACGAGGGCA CCGGCATCGG TCTGGCGCTG TGCCGCAAGA TCGTCGAGTT CCACGGTGGC
CGGATCTGGC TCGACGGTAC GGTGGAGTCC GGGACCGCCT TCCGCTGGAC GCTCCCGAAA
CGCCCGGCCG TGGTCCTGCC CGCCGACCTT GAGGCGGACG GGGCGCCGCC GGCCTCGGTC
ACCGGTCCTG TCTCGTCCGC CGGTCCCGGT CCCGGTTCGG TTGCTGGTCC TGTCGTGGTT
GCTGGTCCCG TCGTGGCCGC CGTTCCGCCG AAGTCAGGAG ATCGTTCGTG A
 
Protein sequence
MRGWRLRRRL AAVYIALSMV LLLAAGIVAW SLVRLDDAIH VRSDVLAPAQ TVTVRLTSSL 
VDQETGMRGY LLQGDPAFLT PYETGRQSEA RLSSVLVGRV AARPELERLL GVLQARIADW
HRDYADPSIA AVRRGGTDSP GILDPVLGKT RFDAVRAAAA DFDDAVRIAA LAAREDVTNA
LRTLIISLVV AALALIVLLA LIFRALRVWV TRPLEAVGTD VREVAAGHLE HAVESVGPPD
IVTLARDVES MRLQLIGELE AARRARLSTE AQAEILRRSN RDLEQFAYVA SHDLQEPLRK
VASFCQLLSR RYGDQLDERG TQYIHFAVDG AKRMQQLIND LLAFSRVGRT TDSFVDVPLG
EVFERVVGTL SLALESSGGE VTASDLPAVP GDPVLLAQLL QNLIGNALKF RGDEAPRVRI
GAIDRDVEWE LFCADNGIGI EPEYAEKIFV IFQRLHARDV YEGTGIGLAL CRKIVEFHGG
RIWLDGTVES GTAFRWTLPK RPAVVLPADL EADGAPPASV TGPVSSAGPG PGSVAGPVVV
AGPVVAAVPP KSGDRS