Gene Franean1_6937 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6937 
Symbol 
ID5675250 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8451033 
End bp8452775 
Gene Length1743 bp 
Protein Length580 aa 
Translation table11 
GC content72% 
IMG OID641245786 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001511177 
Protein GI158318669 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.399913 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTCCCCA CGGTGGCCCG TCTCGAACTC GACGACCTGC TCAGCCAACT CGTCGACCGC 
GCCCAGGACG TTCTCGCCAC CCAGGGCCGG CTGCGCGGCC TGCTGCACGC GAACCGGGTC
ATCGCCACCG ACCTGCGACT GCCTGTCCTA CTCCGGCACA TCGTCGAGGC CGCAACGGAT
CTGCTGGGTG CCCGCTACGG CGCACTCGGC GTCGTGGCCC CCGACCGCAC ACTCGAGGAA
TTCGTCCACG TCGGCATGAC CGACGCGGAC GTGGAGCGGA TCGGCCACCT GCCCACCGGT
CACGGCCTGC TCGGCATCCT GATCGACGAC CCGCGGCCGC GCCGCGCCGA CGACATCGCC
CATGACCCGG CGTCCCAGGG CTTCCCGGCC GGGCATCCGC CGATGCGGAC CTTCCTCGGC
GTCCCGATCA CTGTGCGGGG CGAGGTGTTC GGCAATCTCT ACCTCACCGA CAAACGCGAC
GGCGTCCCCT TCACCGCGGA GGACGAGGAG CTCGCCCAGG CCCTGGCCGC CAACGCCGGG
GTGGCGATCG CGAACGCGCG GCTCTACCAC GAGGCGCAGC AGCGGCACCT GTGGATGACC
GCCTCGGCGG AGATCAGCCG TCAGGTGATG GTCGGCGCCG ACAACGCGCT CGCCACCCTC
GTGCACCGGG TGCAGGAGGT CGCCGACGCC CCCTTCGTCG CCCTCGCGCT GCACACCACG
AACCAGGGCA CTGCGGACGG AGGCGATCGG AGCAAGGAGG CCGGGTACGC GCGCGTCGCC
GTCGCCGTCA CGGAGCGTCA CGTTACCGGC TCGGCCCACG CCGGTCTGGG CACCGATGCC
AGCCGTGCCG GCCGCCTGAT TCCCCTCGAG CACACCCTGA CCGGCCGGGT GATCGCCGAG
CAGCAGGCTC TTCGCGTCGA CGACTCGGAG CTCGACGCGC TCCCCGACGA GCGCGCGGCA
CGCACCGGGC CGCTCATGGT CGTCCCGCTC GTCGCCGGCG GCGACCAGTG CGGCGGAGCG
CTGCTCATCG GCCGCGACCG CGGCGTCCGC GCCTTCACCG ACGGCGACCT CGACATGGCG
GCAGGCTTCG CCGGCCACGT CGCCGTAGCC CTCGAGCTCG CCCGGGCCCG AGCCGACCAG
GAACACCTGC GGGTACTCGC CGACCGCGGC CGGATCGCCC GTGACCTGCA CGACCACGTC
ATCCAGCGGA TGTTCGCCGT CGCGCTCGGC ATGCAGGATC TCGCCCAGTA CGAGAACCCC
TCCAACGCCG GCCGGCTCAA CGGCTACGTC GAGGACATCG ACGCGACCAT CAAGGACATC
CGCCGTTCCA TCTTCGAGCT GCGCGGACAG AGCCCCACCA AGCGCGGTCG CCTGCGCGCC
GGCCTCAACA AGATCGCGGA CGACGTTCGG CTGGCCCTCG GCTTCGCCCC CGCCATCTCC
CTGACCGGGC CCCTCGACAC CGTCGCGGAC GACCAGCTCA CCGACCATCT GCTCGCCGTC
ACCCGCGAGG CCCTCACGAA CACCGCCCGC CACGCCCACG CGACCAGCGT CGAGGTGCGG
CTGGCCGTGG ACGGGGACAT GGTCACCCTG GACGCCGTCG ACAACGGGGT CGGCATCGGT
GACACCACCC GCCGCAGCGG CCTGGACAAC CTGCGCGCCC GCGCCGAGAG CCTCGGCGGC
ACCTTCACCG CCACGACACC GCCCACCGGC GGCACCCACC TCCGCTGGGC CGCCCCGTTC
TGA
 
Protein sequence
MFPTVARLEL DDLLSQLVDR AQDVLATQGR LRGLLHANRV IATDLRLPVL LRHIVEAATD 
LLGARYGALG VVAPDRTLEE FVHVGMTDAD VERIGHLPTG HGLLGILIDD PRPRRADDIA
HDPASQGFPA GHPPMRTFLG VPITVRGEVF GNLYLTDKRD GVPFTAEDEE LAQALAANAG
VAIANARLYH EAQQRHLWMT ASAEISRQVM VGADNALATL VHRVQEVADA PFVALALHTT
NQGTADGGDR SKEAGYARVA VAVTERHVTG SAHAGLGTDA SRAGRLIPLE HTLTGRVIAE
QQALRVDDSE LDALPDERAA RTGPLMVVPL VAGGDQCGGA LLIGRDRGVR AFTDGDLDMA
AGFAGHVAVA LELARARADQ EHLRVLADRG RIARDLHDHV IQRMFAVALG MQDLAQYENP
SNAGRLNGYV EDIDATIKDI RRSIFELRGQ SPTKRGRLRA GLNKIADDVR LALGFAPAIS
LTGPLDTVAD DQLTDHLLAV TREALTNTAR HAHATSVEVR LAVDGDMVTL DAVDNGVGIG
DTTRRSGLDN LRARAESLGG TFTATTPPTG GTHLRWAAPF