Gene Francci3_3447 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3447 
Symbol 
ID3905687 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4099095 
End bp4101779 
Gene Length2685 bp 
Protein Length894 aa 
Translation table11 
GC content73% 
IMG OID637880770 
Productputative signal transduction histidine kinase 
Protein accessionYP_482530 
Protein GI86742130 
COG category[T] Signal transduction mechanisms 
COG ID[COG4585] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTGAGGG GGCTGAGCCC TCATGATGAT GAGGTGCGAA GTCGGCCCCC ACCCTGGTCC 
GGGCGACCTC CTCGGGCCGG GCAGGGCGTG ACCGCCGACG GGCGGGAGGC GGCCTCGTCC
GGGGTGCGCG AGGCGGTCCC GTCCGTGGCG CCGGCCACCG TGCCCACGGT CGCCTCCGCC
TCCGCCTCCG CCTCCGCGGG CACGGTGACC GCCTTCGGGG AGGGCGCGCT CGAGCTGGGC
CGAGCCTCCG TCGAGCTGGG CCGGGCTTAC CAGGCGCTGG AACAGGTCAT GGGGCGCGTC
CTCGTCACCC TGCGGGCGGT CGCGGTTCTG CTGACGGTGT GCTACGTCGC GGTCTGGCGC
TGGTACGACC ACAGGCCGGC GGCGATGGCC GTCGCGGGGC TGACGGCCAT CGGCGGTCTC
GTCTTCTGCG TCGTCGGCCT GCGACGCGGG ATCCGTCGAT CGATGACCAC CGTCACGGTC
GCGATGTCGG CGGTCCTCGC CCTGACGGCC TCGTCCTGGT TGCCGCCCGC TTCGGTGGGG
GACAGCGGGA ACTTCGTCTT CCTGACCGCC ATCAACGCCG GGATCGTGAC GGTCTGGACA
TTTCCGACGG CCGTCGCGTC GGTCCTGCTC CTGACGCTGT GCGGCGCCAC TCTCGTAGGC
GGCTGGGGCC ACAATCCCCA GGTCTTCTCT CAGACCGCCA TGCTGCTGAT CATTCCCGGT
CTACTCGGGC TGGCGATCGG TCGACTGCGC CGGATCGCCC GCACGGCCGA CCAGCGCTGG
GCGAACGTCG CGGCCCGGCA CCGCAGCGAG GCGGTCGTCC TCGCCGTCGC CCGCGACCGC
CGGGAACGGG AGCGAGTCAT CCACGACACG GTGCTGAACA CCCTGACCGG CATCGCCTGG
GGCGGGGGAC GGGACGTGGA ACTCACCCGG CGTCGATGTG CGCAGAGCCT CACCGCCGTG
CGCGGCCTGC TCGACCGGGA CGACGAGGTC GGTCCGCCGA TCGGCGAGCG GCTGGCCGAG
GTCGTCCGCG ACGCCACTCG ACGGGGGCTG CAGGTCAGCC TCGACGACCA GCGTCTCCCA
GCCATCGCGG GGGAACCTCC CGGTGAGCCC CCGGCGGTGG TCGTGGAGGC CTTCGTCGGA
GCGGTCGGCG AGGCGCTGGT CAACGTGGAA CGGCACGCCG GCACCCGGCG TGCCTCGCTG
CTCCTCGGCG GTGGCCCGGG CCTGCTCATC GTGACGGTGT CCGACGCCGG GTGCGGCTTC
GACCCGCGGC GGGTCGACTC CGCCCGGCTC GGGCTGCGCG AGTCCATCGT CGGACGGCTG
GTGGATGTTG CCGGGACCGC CCGCATCGAC GCGCGCCCGG GGCAGGGCAC CGTTGTGGAA
CTGCAGTGGC GCGCCCCGGA CGCGACGACG GGGCACGTGA CGCGACCACA TCCAAGCGAT
GATCGCGATG ATCGTGGCGA TCGGGATGAT CACGGAGCCG GAAGGGCCGG CGGGCGGGTC
GCCGATCGTC GGGACGTCGC CGCGATCGCC GCCGAGCTAC GCGACGCCTA TGCCGCCGGG
CTGCGCCGGG CGCTCGGGCA GATCGCCGGG CTGTGGCTGA TCATCATGCT CGTGCCGTTG
GTCGGCACGG GTGGGTGGGT GCGGACGATG GCCGGAGCGG GAGCGCTGTG GCTGCTGGTG
GCCGTGCTCA CCGTCGTCTT CGTCCGGCGC GGACGGAGTC GGCCGATCAG CGGTCCCGAG
GCGGCCGTGC TGCTGTTCCT GGCCGTCGCC GTCGCCGTCG CCGGCATAGC GAACACCGTC
GGCGCGGACA TCGTCCGGAT TGCGGACTGG CCGTTGCTCG TGCTTCCCCT GCTCCTAGCG
TTCATCACCG CCTCCCGGCC ACTCTGGGAA TGGGTGGGTG CGCTGCTGGT CGCGATTGCC
ATGATGATCG TTGCGGTGTT CCTGCGCGGT AGCACCGAAT CGCTGGTGCT GGCGCGGCTG
GGGGTACTGA TCTACGGTGC GTGCGGGGTG CAGATCGTGA CCGCGATGCT GGGTCCCCTG
CTGCGCGGCA CCGCGGAGAC GACCGCCCGC GCCTTGGCCG CCGAGGCTGA GGTCGCCGCC
CAGGTGGATG CGTCCACCAT GATCAGGTGG GAGCGGGCGC AGTGGTTGCG CACCGTGGGG
TGGGAGGTGC TGCCGTTACT CGACGGGGTC GCGGGCGGTT GGTTGGACCC GCGGGAAGCC
GCCGTTCGGA GCAGGTGCGC GATGCGGGCG GCCGCTGTCC GTCGCATGAT CACCGGGGGT
GGTCCGTCGT CGGCGCTGGC GGACCTCGAC GTGGTCGTCG CCGACGCCGA GGCCGCGGGA
ATGACCGTCC AGATCCAGCT GTCCGGCGAT CTGCGGCTCG CGCCAGCTCC TGTGCGGGCC
ATGGTGGCCG ATCAGGTGCG GGAGGTGCTC GCCGCCGTCC CCGGCGGCCG GGCGATTGTG
ACGGTCCTGT GGACGCCGGC CGGCGGCAGT GTTTTCGTCT CGCTGCCCTG GCCGCAGGGC
CTGCCGCCCC CGCAGCTCGG CCGCGGGGGA GAGGACGCGG GCGGGATCGA GGTCGCGGTG
GAGCTGGACG ATCGGTGTCT CAGCCTGGAG CTGACCTGGC CGGCGGCGAG CGCGGTCACG
GACCCGACCG GGCGGGTCGA GTCGATCGGT GGGCTGGCGG AGTAG
 
Protein sequence
MLRGLSPHDD EVRSRPPPWS GRPPRAGQGV TADGREAASS GVREAVPSVA PATVPTVASA 
SASASAGTVT AFGEGALELG RASVELGRAY QALEQVMGRV LVTLRAVAVL LTVCYVAVWR
WYDHRPAAMA VAGLTAIGGL VFCVVGLRRG IRRSMTTVTV AMSAVLALTA SSWLPPASVG
DSGNFVFLTA INAGIVTVWT FPTAVASVLL LTLCGATLVG GWGHNPQVFS QTAMLLIIPG
LLGLAIGRLR RIARTADQRW ANVAARHRSE AVVLAVARDR RERERVIHDT VLNTLTGIAW
GGGRDVELTR RRCAQSLTAV RGLLDRDDEV GPPIGERLAE VVRDATRRGL QVSLDDQRLP
AIAGEPPGEP PAVVVEAFVG AVGEALVNVE RHAGTRRASL LLGGGPGLLI VTVSDAGCGF
DPRRVDSARL GLRESIVGRL VDVAGTARID ARPGQGTVVE LQWRAPDATT GHVTRPHPSD
DRDDRGDRDD HGAGRAGGRV ADRRDVAAIA AELRDAYAAG LRRALGQIAG LWLIIMLVPL
VGTGGWVRTM AGAGALWLLV AVLTVVFVRR GRSRPISGPE AAVLLFLAVA VAVAGIANTV
GADIVRIADW PLLVLPLLLA FITASRPLWE WVGALLVAIA MMIVAVFLRG STESLVLARL
GVLIYGACGV QIVTAMLGPL LRGTAETTAR ALAAEAEVAA QVDASTMIRW ERAQWLRTVG
WEVLPLLDGV AGGWLDPREA AVRSRCAMRA AAVRRMITGG GPSSALADLD VVVADAEAAG
MTVQIQLSGD LRLAPAPVRA MVADQVREVL AAVPGGRAIV TVLWTPAGGS VFVSLPWPQG
LPPPQLGRGG EDAGGIEVAV ELDDRCLSLE LTWPAASAVT DPTGRVESIG GLAE