Gene Franean1_0508 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0508 
Symbol 
ID5668927 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp589386 
End bp591407 
Gene Length2022 bp 
Protein Length673 aa 
Translation table11 
GC content69% 
IMG OID641239437 
Productputative PAS/PAC sensor protein 
Protein accessionYP_001504875 
Protein GI158312367 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID[TIGR00229] PAS domain S-box 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.663118 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACATCAG TATCAGGTGA TGCCCGGCAC GTCGGCGCGG GGAGCTGGGC CGGTGAACAT 
GTCGGTGATG TTGAGGCCGC GGTGCTGCTC GACACGTTGA GCGAGGTGGT GTTCCGCACC
GACGCGGAAG GCCGCTGGAC GTATCTCAAC CGGGCGTGGA CGAGGCTGAC CGGCTTCGAG
ATCGGCCCCA GTCTCGGTGC CCGATTCCTG GACTACGTCC ATCCCGACGA ACTGGAGTAC
ACGGTCGCCC TGTTCATGGG GGTGGTCGCC GGTGGCGCCG ACCACTGCCA CCACGAGACC
CGGTACCGCA CCCGTGACGG CAGCTACCGC CGCGTGCAGA TCAGGGCGAA CGTGATCCGG
GACGGGGACG GCGAGGTGAT CGGTAACACC GGGACGATCG TGGATGTCAC TGACTCCCGT
CGCGGCAGCG AGACGATCGG GGAACACGTC GCCCTGCTCG AACTCGTCTC GGCGGACGCA
CCGGGCGGCG AGCTGCCGGT GGGGGTGGTC GTGTACGACG CTGACCTGCG GCTACGGCGA
GGCTCACCGG TCATCGATCG GCTGGCCGGC GCACCGCAGC GCATCGGTGA CCCCCTCGCC
CGGCTCGTGG AGCGCCTGGC GCCGGCCGAC CCGAGACAAC GCGCCCTGGG CGGCGAATGG
GGGCTGATCG CGGTCGCGAG ACGCACCAAC CACGCCCAGG TCAGCGACCT CGACCTTGCC
GATCCCAGCC GGTCGCTACG GGTATCGGTA ATCCCCTATC TGCGGGACGG GGAGGAGATG
CTCGCGCTCG TCCTGGCCGA CATCACCGAC CTGCGTCGCG CCGAACATCG CCAAGCAGCC
CTGGCCCGGC TCGGCCACCG GGCCATCTCC AGCTCCGACA CCGCCGCGCT GGTGAGTGAA
GCCGCCGGGA TGGTCACCTC GACACTCGAC GTCGCCGGCT GCAAGCTGAT CCACCTCGAC
CCGCGAGGCG GCGACGCAGC CGGCACGAGC CCGCTGCTTG ACCGGGTCCT GACCGACGAC
CAGCCGGTGA TGATCGACAG TCCGGTCAGC GACCTGGCTC GCTGCCTGGC GGGATCCCCG
CCAGGCAGCG ATGTCGCCAC CGTGCTCGCC ACGCGCGTCG GCGGCGCAGG GCTACTGTTC
GGTGTCCTCG CCGCCCACAG CACGACCCCG CGTCACTTCA CCGACGACGA GATCCAGTTC
GTCCAAGCAG TGGCCGACAT CCTCACCGCA GCCCTCGAAC GCGACCGCAC CGACCACGAA
CTGCGCAGAC TCTACCGGCG GGCGGAACAC AGCAAAGCGT GGCTGGCAGC GAGCGCGCAC
ACCATCACCC GGGTCATCGC CGCCACCGAG CCGCGCGACG CCCTCGACCT GATCGCAGCC
ACGGCGAGAA CGATGGCGGG CACCGACATC GGGGTGGTTG CCGTGCCCGA CGAGCACGGC
AACCTGGTGA TAACCACAGC CGACGTCGCC CCCGGCCTGC CAGCTGCTCC TGATTCCCTG
CTCGGGCTGA GCCTCACACA GGGCGTGACC CCGGTGGCCG AGCAGCTCAC GGCCGGTGGC
ACCGTCGTCC TCGACAGCCT CGATCTACGT GGCAGCAGGA TCTCCCCGGC GCCCAACATG
CCCATCGCCA CCGCACTGAT CCTCCCGCTA CGGATCACGC GGCGATCATC GGCCGTGCTC
ATCCTCGGAA ACCACACGGA CACAGCACGC CTCCCGCTCC ACGACATCGA GCTGATCGAG
GCGTTCGCCC ACGACGCCGC GCTCGCCGTC GAACTCATCC AGACCCAACA CGACCGGGCC
CGCCTCGCCG TCTTCCAGGA CCGCGACCGC ATCGCCCGCG ATCTGCACGA CCAGGTCATC
CAACGCCTCT TCGCGATCGG CCTGCACCTG CAGTCCCTGA CCCGAGCCGT CGGCGACATC
GCAGCAGCCC GACTGACCAG CGCCATCAAC GCCCTCGACC ACACCATCGA CGACATCCGC
CACACGATCT TCGACCTGCA CCCCATACAG CCAGGCCAAT AG
 
Protein sequence
MTSVSGDARH VGAGSWAGEH VGDVEAAVLL DTLSEVVFRT DAEGRWTYLN RAWTRLTGFE 
IGPSLGARFL DYVHPDELEY TVALFMGVVA GGADHCHHET RYRTRDGSYR RVQIRANVIR
DGDGEVIGNT GTIVDVTDSR RGSETIGEHV ALLELVSADA PGGELPVGVV VYDADLRLRR
GSPVIDRLAG APQRIGDPLA RLVERLAPAD PRQRALGGEW GLIAVARRTN HAQVSDLDLA
DPSRSLRVSV IPYLRDGEEM LALVLADITD LRRAEHRQAA LARLGHRAIS SSDTAALVSE
AAGMVTSTLD VAGCKLIHLD PRGGDAAGTS PLLDRVLTDD QPVMIDSPVS DLARCLAGSP
PGSDVATVLA TRVGGAGLLF GVLAAHSTTP RHFTDDEIQF VQAVADILTA ALERDRTDHE
LRRLYRRAEH SKAWLAASAH TITRVIAATE PRDALDLIAA TARTMAGTDI GVVAVPDEHG
NLVITTADVA PGLPAAPDSL LGLSLTQGVT PVAEQLTAGG TVVLDSLDLR GSRISPAPNM
PIATALILPL RITRRSSAVL ILGNHTDTAR LPLHDIELIE AFAHDAALAV ELIQTQHDRA
RLAVFQDRDR IARDLHDQVI QRLFAIGLHL QSLTRAVGDI AAARLTSAIN ALDHTIDDIR
HTIFDLHPIQ PGQ