Gene Franean1_5401 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5401 
Symbol 
ID5673732 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6517219 
End bp6518538 
Gene Length1320 bp 
Protein Length439 aa 
Translation table11 
GC content73% 
IMG OID641244256 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001509662 
Protein GI158317154 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones25 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones19 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGACCATCG AGGTGGTCGG GTTTGGGGGG CGCTGGGCCC GCGTGCGGGC GGCCGTCCAG 
GCGCGGGAGC CGTGGCCCCG CGACCGGCGC GCGCTGGTGT TCGACCTCGT GGTCGCGCTA
GCCGCCACGG TCGCCGAGCT CAGCCTCCTC CTCAACGACG ACCACACTGT GCGTGCGCCG
ATGGTGCTGC TAGCGGTCGC GGCCGGCGGT GCGCTCACCG CGCGTCGCCG GGCACCGTGG
ACGGTGCTTG TGATGACGTT GGCCCTGTCC GGAGCGCTGG TGGCCATCGG TGACGCGCCC
GGCGGGGTGC CCGTCCTGGT GGCGCTCTAC ACGGTCGCCG ACCTCGACGA CTGGCGCCTC
TCGCTCGCCG CGCTCGCGCC CACCGCGGTG CTCCTGACCA TGTTGTCCAT CGTCTCCGTC
CCGCCGACGG CGGGGGTGTG GGCGCTGGGC GCCTACGCGC AGACCCGCCG CCGCTACGTC
CGCGCGCTCG AGGAACGCGC GGAGCACCTT CAACGCGAAC GGGAGCAACT AGCCCGGATC
GCGGTGCACG AGGAGCGGGC GTCGATCGCC CGCGAGCTGC ACGACATCGT CGCTCACTCG
GTGACCGTGA TGCTGCTGGG CGTGCGTGGC GCCCGCGACG TGCTGCGCGT CTCCCCAGAC
CGAGCCGACG ACACACTGGC GCGGGTGGAG ACGAACGGGG AGCAGAGCCT CGTCGAGCTG
CGGCGGATAC TGACCGTGCT GCGTGCTCCC GACACCCCCG CCGACTCACG TCCCGCCCCA
TCTCTGACGG AACTGGACGA GCTCGTCGTC GACTACCGTG ACGCCGGGCT GCCCATCCAC
CTACGGGTGA CGGGGGAGCG AAGACCGCTT CCCGGCGGCG TGGAGCTCTG CGTCTACCGC
GTCATCCAGG AGGCGTTGAC CAACGCGCTG AAGCACTCGC GCCCCAGGCA TGTCACGGTC
ACGCTGGCCT TCCTGGGCTG GTGCCTCGAC GTCGAGGTGG CCAACGACAG CACGGCCCCA
GCGCCGGGGC CGACCGGCGA TGGCGCCGGG GATATGCCCG AGACCGGGAA CCCCGCTGGG
CACGGCCTTA TCGGGATGCG CGAGCGGGTT GGGGTGCTCG GCGGCGAGCT GGAGTTCGGG
CACCGACCCG GCGGCGGCTT CCGCGTCGCG GCCCGCCTGC CGCTGGGCGG CGGAGCTGCA
CGCCGCCAGC TACGCCGGCC GCGTCGACGT GGTCCGTCTG CTGCTCGACC GCGGCGCCGA
CATCGACGCC GCGGACCGGC ATACGTGCCC AAGGGATCCG CGCGACCGTC GAGGAGATGA
 
Protein sequence
MTIEVVGFGG RWARVRAAVQ AREPWPRDRR ALVFDLVVAL AATVAELSLL LNDDHTVRAP 
MVLLAVAAGG ALTARRRAPW TVLVMTLALS GALVAIGDAP GGVPVLVALY TVADLDDWRL
SLAALAPTAV LLTMLSIVSV PPTAGVWALG AYAQTRRRYV RALEERAEHL QREREQLARI
AVHEERASIA RELHDIVAHS VTVMLLGVRG ARDVLRVSPD RADDTLARVE TNGEQSLVEL
RRILTVLRAP DTPADSRPAP SLTELDELVV DYRDAGLPIH LRVTGERRPL PGGVELCVYR
VIQEALTNAL KHSRPRHVTV TLAFLGWCLD VEVANDSTAP APGPTGDGAG DMPETGNPAG
HGLIGMRERV GVLGGELEFG HRPGGGFRVA ARLPLGGGAA RRQLRRPRRR GPSAARPRRR
HRRRGPAYVP KGSARPSRR