Gene Francci3_3251 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3251 
Symbol 
ID3904422 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3848815 
End bp3850593 
Gene Length1779 bp 
Protein Length592 aa 
Translation table11 
GC content69% 
IMG OID637880576 
Productputative signal transduction histidine kinase 
Protein accessionYP_482337 
Protein GI86741937 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones18 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.956838 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAACTG GGTCCGCAGC CAGCCAGAAG CGGGCGGGAA CGGACCCTTC GTCGGTCTCT 
TCGTCGGTCT CTTCGACGGG CTCACCGACG AGCGAGAACC TGACTTTCCC GCTCGTCGCG
CGGCTCGAAC TCGATGAACT GCTGACGCAG CTGGTTGACC GGGCCCAGGA TGTGCTGGCC
ACGCAGAGCC GGTTGCGCGG GCTGCTGGCG GCGAGTCGGG CCATCGCCAC GGACCTGCGG
CTGCCGGTGC TGCTCAGGCA CATCGTGGAG GCGGCGTGCG AGCTGCTGGA CGTCCGGTAC
GGGGCGCTGG GCGTCGTCGC GCCGGACCGC ACCCTGGAGG AGTTCGTCCA CGCCGGCATG
GACCCGCGGG ACGTCGACCG AATCGGTCAT CTGCCCACCG GCCACGGGGT GCTCGGTCTA
CTGATCGATG ATCCTCGGCC GCGTCGCCTC GACGACATCT CCCGTGACCC GAGTGCGTAT
GGCTTCCCCC CCGGCCATCC GCCGATGAGG ACCTTCCTCG GCGTACCTAT CACCGTGCGC
GGCGAGGTGT TCGGCAACCT CTATCTGACC GAGAAACGCG GCGGCACGGC CTTTACCGCC
GAGGACGAGG AACTCGCGCT CGCCTTGGCC GCCAGCGCCG GTGTCGCGAT CGAGAACGCC
CGGCTGTTCC ACGACACCCA GCAGCGGCAC CGGTGGATGA GCGCGTCCGC CGACGTGACC
CGCCAGATCA TGGCCGATGC CGACGGCGCC CTCGAGTCTG TTGCGGAACG CGTCCGCACG
GTCGCCGACG CGGAGTTCGT CTCCATCGTC CTGCGCGACG AGGAGGCCGG CAGCGCGCGG
GTCGCCGTCG CCGTCACCGT CGGAGCGGAC ACCGTTTCCG GCGCCGGTCA GCGGATACCT
CTCGACGGCA CCCTCACCGG TCGGGTGATG GCCGAGCAGC AGCCGTTGCG CGTGGAGGAC
GCGCAGCTTG ACGCCCTCCC CGAGGAACGG GACGCCGCCA CCGGCCCCCT GATCGTCCTC
CCGATGGTCG CCGGCACCGA CCACGTGTCC GGAGTGCTGC TTGTCGGTCG AAACCGGGGG
AAGCGTCCCT TCAGCGACAC CGATCTCGCC AGTGCCGCGA GCTTCGCGGG CAACGTCGCG
ATCACTCTGG AACTGGCGCG GGTCAAGGCC GACCGGGAGC GGTTGGTGGT CCTGGCCGAC
CGTGGTCGCA TCGCCCGTGA CCTGCATGAC CACGTCATCC AGCGGATGTT CGCCGTTGCG
CTCGGCTTGC AGGACATCGC CCAGTACGAA CGTCCATCGA ACGCCGAGCG CATCAACCAG
TACGTCGAGG ACCTTGACGT TACGATCAAG GACATTCGGC GGTCGATCTT CGAGCTGCGT
GCCAGCGGCG CGTCCAAGCG AAGCCGACTG CACGCCGCGA TCGACCGGAT CGCCGAGGAC
GTGCGTCCGG CCCTCGGCTT CACGCCGACC GTCCGCTACT CCGGACCGCT CGAAACCGTG
ATCGGTGACG ATCTCGCCGA CCAGGTGATC GCTGTCGCCC GGGAGTCGTT GACCAATGCG
GCGCGGCATT CGCAGGCCGG GACCGTCGAG CTCCGTATCG GCGTGGCCGG CGAGTCGGTG
GTCGTCGACG TGATCGACGA CGGTGTCGGC ATAGGTCCCG GCGGCCGGCG CAGCGGGTTG
GACAACCTCC GGACCCGGGC CGAGCAGCTC GGCGGATCTT TCACCCTCAC CACACCGGCA
GGCGGGGGCA CCCACCTGCA CTGGGCCGCG CCCCTGTGA
 
Protein sequence
METGSAASQK RAGTDPSSVS SSVSSTGSPT SENLTFPLVA RLELDELLTQ LVDRAQDVLA 
TQSRLRGLLA ASRAIATDLR LPVLLRHIVE AACELLDVRY GALGVVAPDR TLEEFVHAGM
DPRDVDRIGH LPTGHGVLGL LIDDPRPRRL DDISRDPSAY GFPPGHPPMR TFLGVPITVR
GEVFGNLYLT EKRGGTAFTA EDEELALALA ASAGVAIENA RLFHDTQQRH RWMSASADVT
RQIMADADGA LESVAERVRT VADAEFVSIV LRDEEAGSAR VAVAVTVGAD TVSGAGQRIP
LDGTLTGRVM AEQQPLRVED AQLDALPEER DAATGPLIVL PMVAGTDHVS GVLLVGRNRG
KRPFSDTDLA SAASFAGNVA ITLELARVKA DRERLVVLAD RGRIARDLHD HVIQRMFAVA
LGLQDIAQYE RPSNAERINQ YVEDLDVTIK DIRRSIFELR ASGASKRSRL HAAIDRIAED
VRPALGFTPT VRYSGPLETV IGDDLADQVI AVARESLTNA ARHSQAGTVE LRIGVAGESV
VVDVIDDGVG IGPGGRRSGL DNLRTRAEQL GGSFTLTTPA GGGTHLHWAA PL