Gene Franean1_1969 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1969 
Symbol 
ID5670370 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2365089 
End bp2366795 
Gene Length1707 bp 
Protein Length568 aa 
Translation table11 
GC content76% 
IMG OID641240890 
ProductGAF sensor signal transduction histidine kinase 
Protein accessionYP_001506312 
Protein GI158313804 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.197678 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.851655 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGACGTCC CGGACGCGTT CGACGAGTTC GGCGCGGAGC GCCCCGGCCG GCTCGAGCTG 
GACGAGCTGA TGGCGCAGCT CGTCGAGCGC GCGCATGAGG TGATGACCAC CCAGGGCCGG
CTGCGCGGCC TGCTGCGGGC GCACCGCGCG GTCGCCGCCG ACCTGAGCCT GGAGGTCGTC
CTCCGGCGGA TCGCGGAGGC CGCCTGCGAG CTGGTCGACG CCCGCTATGG CGCGCTCGGC
GTGATCGCCC GCGACGGCCG GCTCGAACAG TTCATCCACG TCGGGATGGA CGCCGATCTG
GTCGGCCGGA TCGGCCACCT GCCCCGCGGC GAGGGGGTGC TCGGGCTGCT GACCGCCGAG
CCCCGGGCCG TCCGCCTCGA TGACATCGCC GCGCACGAGC ACGCGGTCGG CTTCCCGCCC
GGCCACCCGC CGATGCGCAC GTTCCTCGGC GTCCCGATCA AGGTCCGCAG CGAGGTCTTC
GGGAACCTCT ACCTGACCGA GAAGCGCGGC GGGCGCCGTT TCACCGCGCA GGACGAGGAG
CTCGTCCTGG CCCTCGCCGC GAGCGCCGGC GTGGCGATCG AGAACGCCCG GCTGTTCGGT
GCGGCGCAGC GCCGCCAGCA GTGGCTGCAG GCATCCGCGG ACATCATGCG CCACCTGCTG
GCGGACGGGC CGGAGCCGCT CACGCTGATC GTCGCGCGGG CCCGCGAGGT CGCCGACGCC
GATCTGGCGT GCGTCCTGCT CGCCGACGGA GCGACCGAGG AGCTGCTCGT CGACGCGGCC
GACGGCCCGC AGGCCGACCG CCTCCTGGGC GAGTCCGTTC CGATGGCCGG AACCCTCGCC
GGCCGGGCGG TCGCGGCCGG GCGGCCCCTG CTGGTCGACG ACGCCGCGGC CGAGCCGGGC
GTCACCGGCT TCGGTGGCCT CGACATCGGC CCGCTGATGG TCATCCCCCT GGTCGGGGCG
CAGGTCGGGA CGGGCGCGGT CGTGCTGGCC CGGGGGCCCG CGGGCCGGCC GTTCGCCGAC
GGCGATCTGG ACATGGCGGC GACGTTCGCC GGGCATGTGC AGGTCGCCCT CGGCCTGGCC
GCGTCCCGGG CCACCCGTGA CCGGCTGCTC GTCCTGGAGG ACCGCGACCG GATCGCCCGC
GACCTGCACG ACCACGTCAT GCAGCGGCTC TACGCCGTCG CGCTGGGTCT GCAGGGGATG
GCGGCCGCCG AGGAGCGCCC GCAGTCCGCC GGCCGGCTCA CCACCTACGT CGACGACCTC
GACGCGACCA TCCGGGAGAT CCGCTCGACG GTCTTCGAGC TGCGCGGGCG GCGCAGCACC
GGCGGGCCGG GCGTGCGGGC CCGCCTCGGC GAGATCGTCG AGGAGGTCGC CGAGGCGCTC
GGCTTCAGCC CGCGCCTGCG GGTGGACGGC CCGCTCGACA CCGCGCTGGA GGGGAACATC
GCCGACCATC TCCTCGCCGT CGCGCGGGAG AGCCTGTCGA ACGTGGCGCG CCACGCCCGC
GCCAGCCGGG TGGAGCTGTC GGTCACCGTC GGCCAGGGCT GGCTGTGCGC CGAGGTCACC
GATGACGGGG TCGGGCTGGG CGACACCGGC CGGCGCAGCG GCCTGCGCAA CCTGCGCAGC
CGCGCCGAGG AGCTCGGCGG GACCTTCGAC ATCGCCCCCG GCCCGTCCGG CGGCACCCGG
CTGCGCTGGG CGGTCCCGCT GCCGTAG
 
Protein sequence
MDVPDAFDEF GAERPGRLEL DELMAQLVER AHEVMTTQGR LRGLLRAHRA VAADLSLEVV 
LRRIAEAACE LVDARYGALG VIARDGRLEQ FIHVGMDADL VGRIGHLPRG EGVLGLLTAE
PRAVRLDDIA AHEHAVGFPP GHPPMRTFLG VPIKVRSEVF GNLYLTEKRG GRRFTAQDEE
LVLALAASAG VAIENARLFG AAQRRQQWLQ ASADIMRHLL ADGPEPLTLI VARAREVADA
DLACVLLADG ATEELLVDAA DGPQADRLLG ESVPMAGTLA GRAVAAGRPL LVDDAAAEPG
VTGFGGLDIG PLMVIPLVGA QVGTGAVVLA RGPAGRPFAD GDLDMAATFA GHVQVALGLA
ASRATRDRLL VLEDRDRIAR DLHDHVMQRL YAVALGLQGM AAAEERPQSA GRLTTYVDDL
DATIREIRST VFELRGRRST GGPGVRARLG EIVEEVAEAL GFSPRLRVDG PLDTALEGNI
ADHLLAVARE SLSNVARHAR ASRVELSVTV GQGWLCAEVT DDGVGLGDTG RRSGLRNLRS
RAEELGGTFD IAPGPSGGTR LRWAVPLP