Gene Francci3_1486 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1486 
Symbol 
ID3903123 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1777591 
End bp1779330 
Gene Length1740 bp 
Protein Length579 aa 
Translation table11 
GC content72% 
IMG OID637878823 
Productputative signal transduction histidine kinase 
Protein accessionYP_480591 
Protein GI86740191 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0200919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.602457 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGGAC CCGTCCCCGA TGTCGATCGC GCGCGGCTCG AACAGCTTCT GATCCAGCTC 
TCCGAGCAGG TGAACGAAGT GGCGTCCGTG CAGACCCGGA TGCGTGGCCT ACTCGACGCG
GTCGTCGACG TGGCCCGGGA ACTGAGCCTG CCGGTCACCC TGCGCCGGAT CGCGCAGGCC
GCCTGCCAGC TCGTGGGGGC CAAGTACGGC GCCCTCGGGG TGATCGGCGA CGACGGGGAG
ATCTCGGAGC TTATCCACGT GGGCTTCCCT GACGAGGTGC CCACCATGAT CGGCAGGCTT
CCCGAGGGGC GCGGCCTGAT CGGGGAGAGT CTGCGCCACC CGCGGCCGTT GCGGGTGGCC
GACGTCGCCC GTCATCCCGT CGCGATCGGG TTTCCCAGCG GGCATCCGCG GTTCGACACC
TTCCTCAACG TCCCGATCAT GGTGCGGGGC GCGGCGTTCG GAACGCTGTT CCTCGGCGCG
AAGCGCGGCG GCGGCGAGTT CACCCAGGAG GACGAGGACC TGGCGTGCGC GTTGGCGGCC
GCGGTCGGTT TCGCGATCGA GAACGCGCGG CTGTACGAGG AGACCAGGCG CCGGCAGGCC
TGGCTCTCGG CCAGCGCGGA GATCACGACC GCGCTGCTGT CGGTTGCGGA GCCCGAGAAG
GCGCTCGACC TCGTGGCCCG GCGGGCCCGT CAGGCGACTG CGGCCCGGCT TGCCGCGATC
CTCGTCCCCG ACGAGGTCGG CCTGGTGGTC GGGGTTGTCG ACGGGGAGGG GGACGACGAC
CTGCGGGGCC GGGTCTTCGC CGACAACCGC CGCCTGAACG AGGCCATGCG CACCGGCCGG
GCCGTGCTCG TGCCCGTCGA CTCCCCGGGC GGTCCGCTGT TCGGCGCGGA CCATGCCGAC
CTCCCGGTGA AGGTCGGCAT GGTGGTGCCG CTGATGGCGG GTGGCCGGGC GCTGGGCATC
CTTGTCCTCG GCTCGGGTGG GCGTTCGGCC TCCTTCGGTG GTCTGGACCT GGAGATGGCC
GCGGCTTTCG CCGGGCAGGC CGCGCTCACC CTCGAACTGG CCCGGGTGCA CCGCGACCGG
GAGCGTCTGG CTGTCTTCGA GGAACGGGAC CGCATCGCTC GGGACCTGCA CGATGTCGTG
ATCCAGCGGT TGTTCGCCAC CGGTCTGCAC CTGCAGAGCC TCGCCCGGGC CGTGGACAGC
CGGGCGGCCG AGCGGCTGGA CGCCGCCGCC GGCGAGCTTG ACCAGACCAT CTCCGACATC
CGTCAGACGA TCTTCTCGTT GACCTCCAAC GCTGCTGAGG AGACCGATCT GCGCGCCGAG
ATCCAAGCGG TCATCGCCCA GGCCGAACGG GCCCTGGGGA TCACCCCGAC GGTCCGGCTG
GACGGACCGA TCGACCGCGG GGTCCCCGCG GCCATCCGCC CGCACATGCT CGCGGCGCTG
CGCGAGGCGT TGTCCAACAT CGCCCGCCAC GCGCGGGCCT CCCGGATCCA CGTGTTGCTG
CGGGTCACCG ACGCCGACCT CCTGGTCGAG GTGCGCGACG ACGGCCGCGG CCCTGGCAAG
GCCTCGCGCA GCAGCGGGCT GGCCAACCTG CGACACCGTG CGCTCGACCT CGGCGGCCGG
ATGGAGTTCG GCCCGGGAGC GGGCGGCATC GGCACCACCA TGACCTGGCA GGTTCCAGTG
ATCGCGCCGT TGCCCGAGCT TCGCGTGCAC TCCGGCGAGG GATGGGGCGT CCTCGGTTAG
 
Protein sequence
MSGPVPDVDR ARLEQLLIQL SEQVNEVASV QTRMRGLLDA VVDVARELSL PVTLRRIAQA 
ACQLVGAKYG ALGVIGDDGE ISELIHVGFP DEVPTMIGRL PEGRGLIGES LRHPRPLRVA
DVARHPVAIG FPSGHPRFDT FLNVPIMVRG AAFGTLFLGA KRGGGEFTQE DEDLACALAA
AVGFAIENAR LYEETRRRQA WLSASAEITT ALLSVAEPEK ALDLVARRAR QATAARLAAI
LVPDEVGLVV GVVDGEGDDD LRGRVFADNR RLNEAMRTGR AVLVPVDSPG GPLFGADHAD
LPVKVGMVVP LMAGGRALGI LVLGSGGRSA SFGGLDLEMA AAFAGQAALT LELARVHRDR
ERLAVFEERD RIARDLHDVV IQRLFATGLH LQSLARAVDS RAAERLDAAA GELDQTISDI
RQTIFSLTSN AAEETDLRAE IQAVIAQAER ALGITPTVRL DGPIDRGVPA AIRPHMLAAL
REALSNIARH ARASRIHVLL RVTDADLLVE VRDDGRGPGK ASRSSGLANL RHRALDLGGR
MEFGPGAGGI GTTMTWQVPV IAPLPELRVH SGEGWGVLG