Gene Francci3_1560 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1560 
Symbol 
ID3904792 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1870483 
End bp1872249 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content72% 
IMG OID637878897 
Productputative signal transduction histidine kinase 
Protein accessionYP_480665 
Protein GI86740265 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.298915 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value0.578819 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCCG AACGGGACCT CGCCTCCGCG GACGAAGGGA CCTCCGATCT GGCCGGCACC 
GCGCCGTCTC CGGGACCGCC GGGAGAGCTG ATCTTTCCCG CGGTCGCTCG GCTCGAGCTC
GACGAGCTTC TCGCCCAGCT GGTGGACCGG GCCCAGGACG TTCTGGCCAC TCAAAGCCGG
CTGCACGGCC TGCTGCGGGC CAACCGGATG GTCACCGCCG ACCTCAGTCT CGACGTGGTG
CTACGGCGCA TCGTCGAGTC CGCGTGCGAG CTGGTTGACG CCCGCTACGG CGCGCTCGGG
GTAATCTCCC GCGACGGTCG GCTCGAACAG TTCGTGCACG TCGGGATGGA CCCGGGACTC
GTCGAGACGA TAGGGCGGCT GCCCCGGGGA GACGGCGTGC TGGGCCTGCT CATCGCGGAA
CCGCGGCCGG TGCGCCTCGA CGACATCGCC GACCACCTCC AGGCGGTCGG TTTCCCCCCG
GGCCATCCAC CGATGCGGGC CTTCCTCGGG GTGCCGATCC GGGTCCGCAA CGAGGTCTTC
GGCAATCTCT ACCTCACTGA GAAACGCGGC GGTCGGGTGT TCACCGCGGA GGACGAGGAG
CTGGCGCTCG CGCTGGCCGC GAACGCCGGT GTGGCGATCG ACAACGCCCG GCTGTTCGGG
GAGGCCCAGC ATCGCCAGCA GTGGTCGCAG GCATCCGCCG ACATCACCCG GCATCTGCTC
GCAGACGGCG ACGACCCGCT GGAGCTGATC GTCCAGCGGG CCCGCGACGT CGCCCGCGCG
GACACGACGG CGCTGGCCCT CGGGTCGGAG ACGGCGACCG AGCTGTCCTT CGACGTCGCC
GCCGGGCAGG ACGCGGACCT GCTGCGGGGC CGACGGGTAC CGATAGAGAG CTCGCTGGCC
GGCCGCGCGG TATGCGACCA TGCGCCGCTG GTCGTCGCCG ACGTCCGTGA ACTGGCCGAA
TCCGAGGTCC ACGACCTCGA CATCGGGCCA ACGATGATCG TTCCGCTGAT CACGTCACAT
GTCACGTCCG GGGCGCTGAT CCTCGCCCGG CGCCGGGGCG GTGACCTGTT CACAGACGCG
GACCTGGAGC TGGCGGCCGC GTTCGCCGCG CACGTCGCGC TCGCGCTGGA ACTCGCCCGC
TCCCGGGCCG CACGCGGCCG GCTGTCCCTG CTGGAGGACC GCGACCGGAT CGCCCGCGAC
CTGCACGACC ACGTGATGCA ACGCCTGTTC GCGGTGGCGA TGGGCCTGCA GGGCCTCGCG
GCCAGCGAGG AGCGGCCCAC CCGGGCGGAA CGGATGAACA CCTACGTGGA GGATCTCGAC
GAGACGGTAC GGGAGATCCG CCACACCATC TTCGAGCTGC GGGGCCGGGC GGCACCGGCG
ACGGGCAGCG GCCTGCGGGC GCAGATCCTG GGCGTCATCG ACGACATGAC CGGCGCGTTC
GGCTTCACCC CGCGGGTCCG GCTGGACGGA CCGGTGGACA CGATGATCGG CTCTACCGTC
GGCGACCATC TGATCGCGGT CGTGCGCGAG GCGCTGTCCA ACGCCGCCCG GCATGCCAAG
GCGAGCAGTG TCGAGGTGAC CGTGACCGCC GCGAGCGGCG CCGTCACCGT CGACGTGGTG
GACGACGGGG TCGGCATCGG CCCTACCACC CGCCGCAGCG GGCTGGCGAA CATGCGGGCC
CGGGCGCGGG ATCTCGGCGG GACCTTCGAA CTCGGCCCGG GCCCCGACGG CGGCACCCAG
CTGCGCTGGT CGGTGCCGAC TAGCTGA
 
Protein sequence
MESERDLASA DEGTSDLAGT APSPGPPGEL IFPAVARLEL DELLAQLVDR AQDVLATQSR 
LHGLLRANRM VTADLSLDVV LRRIVESACE LVDARYGALG VISRDGRLEQ FVHVGMDPGL
VETIGRLPRG DGVLGLLIAE PRPVRLDDIA DHLQAVGFPP GHPPMRAFLG VPIRVRNEVF
GNLYLTEKRG GRVFTAEDEE LALALAANAG VAIDNARLFG EAQHRQQWSQ ASADITRHLL
ADGDDPLELI VQRARDVARA DTTALALGSE TATELSFDVA AGQDADLLRG RRVPIESSLA
GRAVCDHAPL VVADVRELAE SEVHDLDIGP TMIVPLITSH VTSGALILAR RRGGDLFTDA
DLELAAAFAA HVALALELAR SRAARGRLSL LEDRDRIARD LHDHVMQRLF AVAMGLQGLA
ASEERPTRAE RMNTYVEDLD ETVREIRHTI FELRGRAAPA TGSGLRAQIL GVIDDMTGAF
GFTPRVRLDG PVDTMIGSTV GDHLIAVVRE ALSNAARHAK ASSVEVTVTA ASGAVTVDVV
DDGVGIGPTT RRSGLANMRA RARDLGGTFE LGPGPDGGTQ LRWSVPTS