Gene Francci3_2154 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_2154 
Symbol 
ID3905544 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp2522541 
End bp2524259 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content69% 
IMG OID637879489 
Productperiplasmic sensor signal transduction histidine kinase 
Protein accessionYP_481255 
Protein GI86740855 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value0.32555 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACCCAG AGGGTCGCGG ACCGGCCAAG CCGGTGCGGG TGGGTGCGGG CAGCGGGACA 
CCGGCCTGGC CCGCCGGCGA CGGTGCGCAC GCGTCGCGAC TTGAGACCGT CGCATCCTGG
GCCGCCGTGT TCACGATCGT GATGGCCGCG GTCACGCTAC TCGGTTGGAC CCTGGACAAC
CACGTGCTCA CGAGCATCAA ACCCGTCTGG CCGTCGACGA AACCAAACGG AGCCGTCTGC
GGTCTGCTGC TCGGGTTCTC GCTGTGGAGC CGACTCGGCC GGCGGCGAAC CGCCGTCCGG
GCGGCGGGAC AGGGGGCCGC CGCCCTTGCA CTGGCGATCT CCGGACTCTC TCTCGTCGAG
TCGGCCGTGG ATCGGAATTT CGGCATAGAC GAACTTCTCT TCATCGACCA GGCCACAGCC
TCGCAGTCTC ATCCGGGGCG GATCGTGCCT TTCGGCGCGG CCATTCTGGC GCTGTTGTCG
CTCGCCCTGC TACTGTTCGA TACCCGCCGG TGGCGAGGAA GGTACGTGGC CCAGGCCCTG
GTCTGCGTCG CGGGGTTCAT CTCCTTTACG GTGCTGCTGA GCTATCTGTA CGGGGTCGAT
TTCACTTCCG GTTACACGAG GGTCGCCGTC CCGGCGGCGG TCGCCACGCT GGTGCTGTCG
GTCGGAGCGA CGCTGGCGCG GTTGGGTTAT ACCCCGCTGG CGATCCTGGC GAGTTCCGGG
GCCGGCGGGG GCGTCGCCCG CCAGTTGCTG CCGGCCACGG TCATCGTCCC GGTCCTTGTC
GGGACGACCG CGGTGGCGCT GTGGCGGCTC GGGTTCTACG GCGTCGCGTT CCGGGGTGCC
CTGGTGGTCT CCTATACCGT GGTCATGCTG CTGGCGGTCA CCGTCGGGAT CTGCCGGCGG
CTGGATCATG CCGATGCCGA AAGACTGCGC GTCTCCGCCG ATCTGGCCGC GGCCAACGAG
CGGTTGGCGG GGACGAACGC CCGGTTGGCG GAGGCGAACA CGCGCCTCGT CGAGGCCAAC
AAGCGGATGC ACGAGGCCAT CGACGAGCTC GGCAGTTTCA CCTACAGCGT CTCCCACGAC
CTGCGGGCGC CACTGCGGTC GATGAGCGGT TTCTCCCGAA TCCTGATGGA CGAGTACGCC
GAGGACATGC CCGAGCAGGC CCGCGGCTAT CTCGACCGGG TGCAACGCAG CTCCGACCGG
ATGGGCGCGT TGATCGACGA CCTGCTCGCC TTCTCCCGCC TGGGCAGGCA GCCCATGAGT
AAGCGGGAGG TGGACCCGGC GGCGATCGTG TCCGCGGTGC TGGCGGAGCA AGAGGGCGAC
CGCGCCGGCC GCCCGGTGCG GATCACCGTC GGGGATCTTC CCCGTGGGGT GGCCGACCCG
GTGCTGCTCA AGGTGGTCTA CACCAACCTG CTGTCGAACG CGGTCAAGTT CACCCGGGGC
CGTGACCCCG CCCGCATCGA GATCGGCAGC CGCCGCGAGA ACGGGCGGGT GGTCTTCTAC
GTCGCCGACA ACGGGGCCGG TTTCGATCCG CGGTATGCGG ACAAGCTGTT CGGGGTGTTC
CAGCGGCTGC ACCGCTCGGA ACAGTTCGAG GGAACGGGTG TCGGTCTCGC GCTGTGCCAG
CGCGTCATCG CCCGCCATGG CGGGCGGATC TGGGCGGAGG CGGCGCAGGG GAAAGGCGCG
ACCTTCTTCT TCACTCTGAG CGAGGAGGTA TCGGCGTGA
 
Protein sequence
MNPEGRGPAK PVRVGAGSGT PAWPAGDGAH ASRLETVASW AAVFTIVMAA VTLLGWTLDN 
HVLTSIKPVW PSTKPNGAVC GLLLGFSLWS RLGRRRTAVR AAGQGAAALA LAISGLSLVE
SAVDRNFGID ELLFIDQATA SQSHPGRIVP FGAAILALLS LALLLFDTRR WRGRYVAQAL
VCVAGFISFT VLLSYLYGVD FTSGYTRVAV PAAVATLVLS VGATLARLGY TPLAILASSG
AGGGVARQLL PATVIVPVLV GTTAVALWRL GFYGVAFRGA LVVSYTVVML LAVTVGICRR
LDHADAERLR VSADLAAANE RLAGTNARLA EANTRLVEAN KRMHEAIDEL GSFTYSVSHD
LRAPLRSMSG FSRILMDEYA EDMPEQARGY LDRVQRSSDR MGALIDDLLA FSRLGRQPMS
KREVDPAAIV SAVLAEQEGD RAGRPVRITV GDLPRGVADP VLLKVVYTNL LSNAVKFTRG
RDPARIEIGS RRENGRVVFY VADNGAGFDP RYADKLFGVF QRLHRSEQFE GTGVGLALCQ
RVIARHGGRI WAEAAQGKGA TFFFTLSEEV SA