Gene Franean1_7101 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_7101 
Symbol 
ID5675410 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp8669977 
End bp8671467 
Gene Length1491 bp 
Protein Length496 aa 
Translation table11 
GC content69% 
IMG OID641245944 
Productintegral membrane sensor signal transduction histidine kinase 
Protein accessionYP_001511335 
Protein GI158318827 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTCA TCTTGCCTTG GCTGCCCGGG TTCCCTGTCG CCGTCTACTG CGTCCGCAGT 
AGACGGCGGT ACTTTGGGGC AGGTTCGACG GCAGGAATCG CATTCCCTCG GTACTGTGGG
TTCGTGTTCG GCTGGTTCTC ACGGTTTCGG GATGTCTGCC AGGGCGGCCT GCGGCGGCGT
CCGAGTCTGG CTGACGTTCT TGTCGCGCTC GTGACCTTCG GCCTGGGAGT CCTGATCGCC
ACCCAGACCG CGACCGGTCA CTCCTGGTCG AGCGGGCTCT ATCTGGTCAC AGGTGCGATC
AGTTGCGCTG CGCTGGTGGC CCGCCGGTGC AGGCCGCTGG TGGTCCTCGG ATTCGTCCTT
GGGTGGTCTT CCCTCGTGGC CGTCGTCGTT GGCAGGCAGG GTGACGGGCC GTCGCTGATC
GCGATCTGGG TCGCGCTGTA CAGCGTCGCG GCGTACCGGC CGCGGAAGAC CGCGTGGATC
GCCGCGTCGG GCACCATCGC TTTACTACTG CCGCCTGCGC TGACGCGCGA CGACGCGGTC
ATCCAGGCCG TCGCGGGGGC GTTCCTGTGG GTCGTCACCG TGACCTCCGC GGGCTGCTCG
GTGCGTGATC GCAGGGCCTA TCAGGCGGCG GTGGAGGACC GGGCGCTGCG CGCCGAGCAG
AGCCGCGAGC AGGAAGCCTG GCGCCGGGTG GTCGAGGAAC GGACGCGGAT CGCCCGGGAG
TTGCACGACG TTGTCGCCCA CCACATCACT CTCGTCAAGT CGCAGGCGGC CGTCGCCTCG
CATCTGCTGT ACGAGCAGCC GGCCAAGGCG GACGAGGCGC TGGCACACAT CCGCCTGTCC
AGCCGCACCG TCCTGAACGA GCTGGGGTCG TTGGTGAGCG TGCTGCGCGA CACCCCCGAG
TCGACATCCA CCGAGCCGCC TCCAGGGCTG GCCCGGCTTC CAGAGCTGAT CTCCGCCTTT
CAAGCGATCG GCATGACGGT GTCCTGCACC GTGACCGCGG CGGGATCCGG GTTGCCCCCA
CTGGTGGATC TCGCCACCTA TCGGATCGTG CAGGAGTCGC TGACGAACAT CCGCAGGCAC
GCTCCCGATG CCCGGACGTC GGTCCAGGTC GACCGTTTCC GCACCGAAGT GCGCCTGCGC
ATCCACAACA CGAGGCCCGG CGGCGTCGCA CCTACCGTCG GCTACTCACG CCTCATGACC
CCCGGCTGGG ACGGGACGGC GGAAACGGGA CGCACTCCAG GTGGCCACGG TATCGCGGGG
ATGCGAGAAA GAGCGCGGAC CGTCGGTGGC TGGCTGGACG CCGGTCCCGA CGCGGAGGGG
GGCTTCGCTG TCACCGCGAT GTTGCCGGTC CCGAAGGAGG CGTCCGCCTC CTCACCACCA
TCCGCCGCAC CATCGCCACC GAGCTCGACG TGGTCACCGA TGGTGCCACC CCCCACCGCG
CCGTCGATGT CGGCCCCCGC ACCCGTCGCT CCCAGCCACG CACCGTCGTA G
 
Protein sequence
MTVILPWLPG FPVAVYCVRS RRRYFGAGST AGIAFPRYCG FVFGWFSRFR DVCQGGLRRR 
PSLADVLVAL VTFGLGVLIA TQTATGHSWS SGLYLVTGAI SCAALVARRC RPLVVLGFVL
GWSSLVAVVV GRQGDGPSLI AIWVALYSVA AYRPRKTAWI AASGTIALLL PPALTRDDAV
IQAVAGAFLW VVTVTSAGCS VRDRRAYQAA VEDRALRAEQ SREQEAWRRV VEERTRIARE
LHDVVAHHIT LVKSQAAVAS HLLYEQPAKA DEALAHIRLS SRTVLNELGS LVSVLRDTPE
STSTEPPPGL ARLPELISAF QAIGMTVSCT VTAAGSGLPP LVDLATYRIV QESLTNIRRH
APDARTSVQV DRFRTEVRLR IHNTRPGGVA PTVGYSRLMT PGWDGTAETG RTPGGHGIAG
MRERARTVGG WLDAGPDAEG GFAVTAMLPV PKEASASSPP SAAPSPPSST WSPMVPPPTA
PSMSAPAPVA PSHAPS