Gene Franean1_6497 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6497 
Symbol 
ID5674812 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7898351 
End bp7900189 
Gene Length1839 bp 
Protein Length612 aa 
Translation table11 
GC content76% 
IMG OID641245345 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001510740 
Protein GI158318232 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0417702 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.376991 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGATGA CCGGCACCAC CGGCACCCAC CGGCGGCGCG GCCCGGCCGG CTGGTCGCTG 
CGTACCAGGC TGCTCGCGCT GCTCCTCGCG CTGCTCGCCG TGATGTCCAC GGTGATCGTC
TTCGTGACGG CAGTCGCGTT GCGCGGGGTG CTCGTCCAGC AGCTCGACGA CCGGTTGGAG
GACGCGAGCC GCCGCTGGGG GCTGACCAGC CAGGCCGGAC CGGTCATGCT GCCCGGCTAT
GGAAACGCCG GCCCGGGGGG TGCCGGAACT CCCAGCCTGG CCCAGCCGCC CACGCCGGAC
CTGACCGCCG ACGAGCGGGT GGACTGGTTC CTGGGCTTCC GCGGTCAGCC GTCGGGGACC
CTCGGCGCGG TCTTCAAGCT GCGGGATCCC GACACCACCG CCCAGAGCTC GGCGCGCGAG
GCGCTGCGGG GCGCGCCACC GCCGGGATCG GCGGCCCCGG ACCTGACCTC CCTGATCGAG
AGCGCCGCCG TCGCCGTCAT CGACGACAAC GGCGACGTGC AGCCGCTCAC CGGCGTGGAC
GCCGCCGTGG TGGCCCAGAT CCCGCTCGAC GGCCAGGAAC ACACCCGCGC GATCGCCGAC
CAGGGCGACT ACCGCCTCAT CGCCCACTTC GACCCCAACG GGCAGACGGT GCTCGTCACC
GGCGTGCCGA TGAACGAGGT CAGCGCGACC CTGACCGAGG TCGGGATCGC CGGAGCGATC
GTGGCGGCGC TGGGGGTGCT GGCCGCGGGC ATCGCGGGCG CGGCGATCAT CCGGGTCACG
CTGCGGCCGC TGAACCGGGT CGCGGCGACG GCGAGCCGGG TCGCCGAGCT CCCGCTCGAC
CGGGGGGCCG TCGACCTCAT GGCGGTGGCA CCGGACGTCG ACACCGACCC CCGGACCGAG
GTCGGGCAGG TCGGCGCGGC GCTGAACCGG ATGCTCGGCC ACATCGGGGC GGCGTTCTCG
GCCCGGCACG CGAGCGAGAC GCGGATGCGC CAGTTCCTCG CCGACGCCAG CCACGAGCTG
CGCACACCGC TGGCGGCGAT CAGCGGCTAC GCCGAGCTCA CCCGGCGCAC CCGCGACACC
GTCCCGCCGG ACGTCGCCTA CGCGATGGGC CGGGTCGAGT CGGAGAGCGC GCGGATGACC
GCGCTCGTCT CCGACCTGCT GCTGCTGGCC CGCCTCGACT CCGGACGCCC GCTCGTGAGG
GAGACCGTCG ACCTCTCCCG GCTCGTGGTG GACGCGGTCA GCGACGCCCA CGTGGCCGCG
CCCGAGCACC GCTTCGAGCT GGACCTGCCC AGCATGCCCG TGACCGTGGC GGGCGATCCG
GCCCGGCTGC ACCAGGTCCT CGCCAACCTG CTCGCCAACG CCCGCACGCA CACCCCGCCG
GGCAGCCGGG TGACGGCCCG GCTGACCGTC GAGCCGGGAC CTTCCGCTCC GGGGCCCTCC
GCTCCGGTGG CGAACGAGCC TGGCGCGGGC CGGCCGTCCG CGGTGCTCTC GGTGATCGAC
GACGGGCCGG GCATCCCGGA CGCACTGCTT CCGAAGGTGT TCGAGCGCTT CGCGCGGGGG
GACAGTTCGC GTTCCCGCGC GGCCGGCAGC ACCGGCCTCG GGCTGTCCAT CGTGGCGGCG
GTGGTCGAGG CGCACCATGG ACGGGTCTCG GCCGCGAGCC GTCCCGGCCG CACCGCCTTC
ACCGTGGTCC TGCCCCTGGA CACCGCCGAC CCGGGCGCCC CGGACCCGGG CGCCCTCGAC
CCACCGGGCG CCCTCGACCC ACCGGGCGCC GTCGACGGCT GCGCGCCCGA CGCCGCCGAT
CCCGCCGTGA CGGCCGCGGG GACCCGCCCT CACAGGTGA
 
Protein sequence
MTMTGTTGTH RRRGPAGWSL RTRLLALLLA LLAVMSTVIV FVTAVALRGV LVQQLDDRLE 
DASRRWGLTS QAGPVMLPGY GNAGPGGAGT PSLAQPPTPD LTADERVDWF LGFRGQPSGT
LGAVFKLRDP DTTAQSSARE ALRGAPPPGS AAPDLTSLIE SAAVAVIDDN GDVQPLTGVD
AAVVAQIPLD GQEHTRAIAD QGDYRLIAHF DPNGQTVLVT GVPMNEVSAT LTEVGIAGAI
VAALGVLAAG IAGAAIIRVT LRPLNRVAAT ASRVAELPLD RGAVDLMAVA PDVDTDPRTE
VGQVGAALNR MLGHIGAAFS ARHASETRMR QFLADASHEL RTPLAAISGY AELTRRTRDT
VPPDVAYAMG RVESESARMT ALVSDLLLLA RLDSGRPLVR ETVDLSRLVV DAVSDAHVAA
PEHRFELDLP SMPVTVAGDP ARLHQVLANL LANARTHTPP GSRVTARLTV EPGPSAPGPS
APVANEPGAG RPSAVLSVID DGPGIPDALL PKVFERFARG DSSRSRAAGS TGLGLSIVAA
VVEAHHGRVS AASRPGRTAF TVVLPLDTAD PGAPDPGALD PPGALDPPGA VDGCAPDAAD
PAVTAAGTRP HR