Gene Francci3_0291 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_0291 
Symbol 
ID3903035 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp337183 
End bp338814 
Gene Length1632 bp 
Protein Length543 aa 
Translation table11 
GC content70% 
IMG OID637877620 
Productmulti-sensor signal transduction histidine kinase 
Protein accessionYP_479407 
Protein GI86739007 
COG category[T] Signal transduction mechanisms 
COG ID[COG4251] Bacteriophytochrome (light-regulated signal transduction histidine kinase) 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.808463 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCGGAGC GCACCCACCC GAACGTCACC CACCCGAACG TCACCGACGG GGGAGCGGGC 
GACGCGGGGC CGAGCGGAGC GGTGGAGCGC GGCCGGGTCG GGTCCGTCTG GCGCCTGCGC
CGTCGACTCG AGGTGACCTA CGCGGTGGCG GCGATGGTGC TGATGATCGT CGCTGGTCTC
GTCGTCTCGT CGTTGGTCCG GCTTGATGAC GCCCTCCACA CGCGCAGCGA CGTCCTCGCC
CCGGCTCTGC TGAGCTCCAC CGAACTGGTC TCCAGCCTGG TCGACCAGGA GACCGGGATG
CGGGGCTACC TGCTTCAGGG CAGCGAGGAC TTCCTGGAGC CCTACAACGA AGGGCGCAGG
GCCGAGCGCA CGCTGCTCGC GGACCTGCAC CGACGGCTCG TGGAGCGCCC CGACCTGCTG
GCGAGGCTCG CCGTGCTGGA GGACCGGATC CGGGCCTGGC ACGGCGACTA CGCCGATCCG
GCCATCAACG CGGTGCGGGT GGGCGGTCCC GCGGCGGCCC GCCTGTTGTC ACCGGAGTTC
GGCAAGACCC GCTTCGAGGC GGTCCGCGCC GCGGCCGGGG ACCTCGAGGC GGACCTGCGG
GCCAGGGAAG TCCGGGCCCG CCAGCACGTC ACGGCGGCGC TCCGTTTCCT GATCTTCGCG
CTGGTGACCG GCGCGGCCGT CATGGCCGTA CTCCTGGCGG TCATCGCGCG GGCGCTACGG
TCCTGGGTCA CCCGCCCGCT CGAGCGGGTG AGCGACGATG CGCGAACCGT CGCCTCCGGA
CATCTCGATC ACGTGGTCGA GCCGACCGGT CCCCCGGACA TCGCCTCGCT CGCCGCCGAC
GTGGAGTCGA TGCGACGCCA GCTGATCGGG GAGCTCGGGG TCGCGCGGGC GGCGCGGTCC
GCGGTGGAGG CGCAGGCCGA GGCGCTGCGG CGCTCCAATC GTGACCTGGA ACAGTTCGCC
TATGTCGCCT CGCACGACCT GCAGGAGCCG CTGCGGAAGG TGGCGAGTTT CTGCCAGCTC
CTGGAGCGAC GCTACGGTGA CAGGCTCGAC GAGCGGGGCA GCCAGTACAT CGCTTTCGCG
GTGGACGGCG CGAAGCGGAT GCAGCAGCTC ATCAACGATC TGCTGGCCTT CTCCCGGGTG
GGGCGGACTA CTGACGGTTT CGTCGATGTG TCCCTGGGGG AGATCTTCGA CCGGGCGGTG
GGAGCGCTGT CCATCGCGAT CGAGAGCGCC CGGGCCGAGG TCACCGCGGA TGATCTGCCG
GTCGTGCGCG GGGACCCCGT CCTGCTCACC CAGCTGTTCG CGAACCTGAT CGGGAACGCG
CTGAAGTTCC GGGCCGAGAC GGTTCCCGCC GTGCATGTCG GTGTCGTCGA CCGGGCGGAC
GAGTGGGAGC TGTACTGCGC CGACAACGGG ATCGGAATCG ATCCGGAGTA CGCCGAGAAG
ATTTTCGTGA TCTTCCAGCG GCTGCACGGC CGGGATGTCT ACGAGGGCAC GGGCATCGGG
CTGGCGTTGT GTCGCAAGAT CGTCGAGTTC CACGGCGGCC GGATCTGGCT GGACGGCGCG
GTGAAGGACG GAACCACCTT CCGGTTCACT TTCCCTAACC GGAGACTTCT AACGGCCAGA
GACCCGCCGT GA
 
Protein sequence
MPERTHPNVT HPNVTDGGAG DAGPSGAVER GRVGSVWRLR RRLEVTYAVA AMVLMIVAGL 
VVSSLVRLDD ALHTRSDVLA PALLSSTELV SSLVDQETGM RGYLLQGSED FLEPYNEGRR
AERTLLADLH RRLVERPDLL ARLAVLEDRI RAWHGDYADP AINAVRVGGP AAARLLSPEF
GKTRFEAVRA AAGDLEADLR AREVRARQHV TAALRFLIFA LVTGAAVMAV LLAVIARALR
SWVTRPLERV SDDARTVASG HLDHVVEPTG PPDIASLAAD VESMRRQLIG ELGVARAARS
AVEAQAEALR RSNRDLEQFA YVASHDLQEP LRKVASFCQL LERRYGDRLD ERGSQYIAFA
VDGAKRMQQL INDLLAFSRV GRTTDGFVDV SLGEIFDRAV GALSIAIESA RAEVTADDLP
VVRGDPVLLT QLFANLIGNA LKFRAETVPA VHVGVVDRAD EWELYCADNG IGIDPEYAEK
IFVIFQRLHG RDVYEGTGIG LALCRKIVEF HGGRIWLDGA VKDGTTFRFT FPNRRLLTAR
DPP