Gene Franean1_1653 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1653 
Symbol 
ID5670055 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1973530 
End bp1974942 
Gene Length1413 bp 
Protein Length470 aa 
Translation table11 
GC content75% 
IMG OID641240571 
Producthistidine kinase 
Protein accessionYP_001505997 
Protein GI158313489 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.120567 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGACGGAC GCCGCTCTTG CCAGACTGGG CCGGTGATCT GGGCGGCGAT CCGGCGGCAT 
CGGGAACGGG CCGGCCCGGC CAGCCTCACC GTCTGGGCGT GGGTGGCCGA CGCGATCCTC
GCCCTGGTGC TGGCCCTGGG CGTGGTGACC GCCATCAACG GTGACCGCCT CCTCGCGGTG
CCCGCACCGG GGACCTCGCA GTCGGCACGC GTCGAACGGC CGAGTCGGCC GGTCCTCCCG
CTCACACGGG TGACCCCGGT ACCACCAGTG GAACCGGTAC CGTCGGTGGC CCCGGAACCG
CGGGAGGTGC CGTCACTCGC GGGCCCGGGT CAGGTGTTCC CATCGCAGTT CGTCGCCGTC
CAGACGCCGG TCGCCGACTG GCAGCTGGCG CTGGCCGTCC TCGCCGCGCT ACCGCTGGCC
GCGCGGCGGC GGTTCCCGCT GGCGACCTAC GGGTGCGTCA TCGCCGCCGC GTCGTTACTC
CAGCTGGACC TGCACATCGA TGACCTGACG ACGTTCACCT TCACGGCCTG CCTGATAGCC
GGGTACAGCG CGGCGATGTA CAGCCCGCAC CGGGCGTGGG CGGCGGCGGG CATCGCCGGC
GGTGCCGTGC TGCTCGCGCT CGGCCACCAC ACGACGATCC CCAGGATCAG CTCTGACTAC
TTTCCCTTCC TGGTCCTGAT CCCGCTCGGC CTCGCGGCGA ACGCTGCCCA CACCCGCACT
CAGCGCGCGC GGGTGCAGGA GGCGGAACGG GCCGCGGCGA GCCGGCAGGC CACCGATCAG
GAGCGGGCCC GCATCGCACG CGAGCTGCAC GACGTCGTCA CCCACAATGT GAGCGTGATG
GTGATCCAGG CCGGCGCCGC CCGCAAGGTG CTCGACGCCA ACCCCGATCT GGCGCGCGAG
GCCATCCGTG CGGTCGAGAC CAGCGGCCGC AGCGCGATGA CCGAGCTGCG TCACGTGATG
GGCCTGCTGA CGATGAGTGG CGGTGCCCCG GATGGCGGGG ACGACGGCCC GGGCCCCAGC
CCACAGCCGG GGCTCGGCCG GGTCGACGAG CTGGTGCGGC GCGTTCGCGA CACCGGAGTG
GACGTCGAGC TGAGTACGAC GGGAACGCCG GTGCCGTTGC CCGCCGGCGT CGACCTGGCC
GCGTTCCGCG TGGTGCAGGA GGCCCTGACC AATGCCGTCA GGCACGCCGC CGGGGCGCGG
GTCCGGGTCG CCGTGGCCTA CGCACCGGGT CTGGTCCGGG TCGAGGTGAC CGACAGCGGC
GGCGTCCGGG CGGCGGCCGC CGGGGCGGGC AGCGGCGCCG GGCTGCTCGG CCTGCGGGAA
CGGCTCGCCG TCTACGGCGG CACGCTGGCG GCGGGCCCGC GCCCGACCGG CGGGTACCGG
GTGATGGCCG AGATCCCGCT GGACGGCCCG TGA
 
Protein sequence
MDGRRSCQTG PVIWAAIRRH RERAGPASLT VWAWVADAIL ALVLALGVVT AINGDRLLAV 
PAPGTSQSAR VERPSRPVLP LTRVTPVPPV EPVPSVAPEP REVPSLAGPG QVFPSQFVAV
QTPVADWQLA LAVLAALPLA ARRRFPLATY GCVIAAASLL QLDLHIDDLT TFTFTACLIA
GYSAAMYSPH RAWAAAGIAG GAVLLALGHH TTIPRISSDY FPFLVLIPLG LAANAAHTRT
QRARVQEAER AAASRQATDQ ERARIARELH DVVTHNVSVM VIQAGAARKV LDANPDLARE
AIRAVETSGR SAMTELRHVM GLLTMSGGAP DGGDDGPGPS PQPGLGRVDE LVRRVRDTGV
DVELSTTGTP VPLPAGVDLA AFRVVQEALT NAVRHAAGAR VRVAVAYAPG LVRVEVTDSG
GVRAAAAGAG SGAGLLGLRE RLAVYGGTLA AGPRPTGGYR VMAEIPLDGP