Gene Francci3_4047 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4047 
Symbol 
ID3907008 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4831006 
End bp4833756 
Gene Length2751 bp 
Protein Length916 aa 
Translation table11 
GC content70% 
IMG OID637881376 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_483126 
Protein GI86742726 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCGATAT CCCGGCGCAA CCTCCCTGGG CTCACCGGGA CTGATCTCCA GACTATCGGT 
CCGTATACAG TTGAACGGAA GCTGGTCGAC GCCAGGACAG GACCAGTGTT TCTTGCACGT
AACGGCGAGG CGCGTCCGGT CCTCGTCAAG ACGATTACTG CGCCCTTCGG GCGGGACGCA
GAGTTCCGTC GGCGCCTGCG CGTCGACCTC GACAACATCC GCCGGTTGGC GCCGTCCTGC
CTCGCCGCCA TCCTCGACCT CGACACGGGC GCTCGCCCCC CATACGTCGT CGCGGAGTTC
ATCGACGCTC CGACTCTCGC CGCAACCGTC GCCGGCGGCG CAGCGTTGTC CGGACCAGAC
ACCTACCGGC TCGCCGTCGG GCTGGCGACG GCTCTGGCCG CACTGCACGA ATTGGAAATT
TTCCTCGGCG ACCTGAAGCC TATCAATGTG GTTCTTTCCG GGCAGGGAGT GCGGCTGGTT
GACTTCGGGC TCTTCCGAGC GATGAATGCG GTAAGCATCA ATAATCCGGG CGGACCGCCG
TCCGGTATCG GAACACTTGC GTTCATAACG CCCGAACAGG CTCTCGGGCA GACCGCCACC
GTCGCATCGG ATGTCTTTAC CTGGGGCGGC ATGCTGTTGT TCGCCGCGAC CGGCCGGCCG
CCATTCGGTG CCGGAACGCC CCGGGTCCTG CTGCAGCGGG CCGTCTACGC GGAACCCGAC
CTGTCCGTGT TCTGCCCGGA GCTGCGAGAG CTGGTCGCGG CGGCGATGCG CAAGGACCCG
AAGCGCCGGC CGGCCGCGGC CGAACTGCTC GAACAGCTGA TGGCCTATCC GACACGAAGC
GAGGCGGAGC CGGCGGTCGA ACCCACCCGG CGTCTCGCCC TGCCGGCGGG TGTCATCGAG
ACGCTGGTGC CGGTGCAGAC GAGGCGGACG GTCGAGTCCG AGACGAAACC GGAGGTCGTA
GGCACGAGCG ACCTCGCCGC TCCTGCCATC GGTGCCATCC ACGGCCTCGA GATCGTGCTC
GAAACCGTGA CCGTACTCGA AACCGTGACC GTGCTCGGGA CCGGGCCGGC CGCGGAGATC
ACGGCCGTCC CGCCGGCCGC GGAGATCACG GCCGTCCCGC CGGCCGCCGC GGTCGCCCTC
GGCGCCGTCC GTACCGGAGC AGAACCGCGG ATGCGTCCGT CGTCTCCCGG TCCGTCGTCT
CCCGCTCCGT CGTCTCCCGG TCCGTCGGCC GCTCTGGCCG GTGTGCCCGC CCCACTGCCC
ACCGCGGCCG CGAGAGCGCG CGCAAGCTAC GTCCAGGAGG GCACCGGGAA TGTCTCCGCC
TTGTCATCAC CATCGTCACC GCGGGATCAC CTGGACCGCG GGTGGCTTCG ACGCGTGCTC
TCGATCGGCG TGGCGGTCTC GGTGCTCGTC CTCTCGACCG TTGTGATCGT CGATACGCTG
CGGGAGCGCA GCGCGGCGGC CACGTCGCGT GAGGCGGCCC GTGGCGCGAT GCGCCTTCTC
GACCGGGAAC CGGATCTCGC CGGCCAACTC GCCGTCTCGG CCTACCGCAT GGCTCCGACC
TCGGCTGCGG CGGAGGCGCT GGTCAATGCG AGCATCCGGC AGATCGGGCC GGCGACGGGA
GCGATCCGCG ATCTGCTGAT CACCCCGGAT GGTCGATATC TGATCATCGT CGGTGACTTC
GGCGGGTCGG TGTGGAACAT CATCGGCCCT GGCCGGGTGC GCTACATCAC CGACCTGCCC
GCGGTGGCCG CGGCCATGGG AGATCGCAGC CCCCTCGCGG CCGGGGTCGC CCCGGCGGCC
GCGGTGAAGG GCCTGCTCGC CGTCGCGCTG ATACCCGCAT CCGGTCCGGC CACGGGTGTG
CGGGCGTCCA CGATCATGGT GACGGCGGGG ACGGACGGGG TGATCCGGTT ATGGCGGCTG
GCCCAGCCCG GGTCCACGGA CGACGGTGAC CTTCAGTCGG CGATCAACAC CGGTCAACGG
GTGAGTCTGC TCGCCGAGTT GCGGGGACAC ACGGGTGCGG TGGCGAGCGT CGCGGTGAGC
GGGGATGGTC GAACCCTGGC CTCGGCGGGC GCCGATCGCG TCGTCCGTCT CTGGGACATC
GGCCATCCGC AGAATCCTCG CGCCCTCGCC GAGCTGCCCC AGCCGGCGGA GGTCACCAGC
CTCGCCTTCA CCCCCGACGG CGACTCGCTC GCGGTCGGGG GGGTAGGCCA TCTCTCCGTC
TGGGATGTGA CCGCTGCCGG GCAACCGCGC CGCCGGGCCC AGCTGACCGC CCCCGCCACT
GTCCGCAAGC TCCTCGTCAG CCCGGACGGC CGGTGGCTCG CCGTGGCGAG TACCTCCGAC
GGCGGCTCGC TGACGGAGAT CTATGGGCTG GACAGCCCCC GGGGACTGCA CCGCCTCACC
GCCATCGCGA GCCGGCCAGG CCAGGCCGGT TCGATCGCGC TCTCCGCCGA CGGGCGGGTC
CTCGCTGTCA GTACCCCGGC CGGTCAGGTA ACGCTCTGGG ACATGCGCTC ACCATCCCGG
CCGGTGCAGC GGGCCACCCT GCCGGTCGGC ACCGCGCCGA CGGCGACGGT CTTCGGACCT
CTGGGACATG AGGGCGTCCT CGCAGTCGTA GCCGGTGACG CCGTCCGTCT CTGGCAGCTC
GACCTGCTCG CGGCCGAGGA CGAGATCTGT GCCAGGGCCG AGGGCCGCAT CAATCGGGAG
CAGTGGCGGA CCTACCTCGG TCACCGGCAC TACGACCCGC CCTGTGACTG A
 
Protein sequence
MAISRRNLPG LTGTDLQTIG PYTVERKLVD ARTGPVFLAR NGEARPVLVK TITAPFGRDA 
EFRRRLRVDL DNIRRLAPSC LAAILDLDTG ARPPYVVAEF IDAPTLAATV AGGAALSGPD
TYRLAVGLAT ALAALHELEI FLGDLKPINV VLSGQGVRLV DFGLFRAMNA VSINNPGGPP
SGIGTLAFIT PEQALGQTAT VASDVFTWGG MLLFAATGRP PFGAGTPRVL LQRAVYAEPD
LSVFCPELRE LVAAAMRKDP KRRPAAAELL EQLMAYPTRS EAEPAVEPTR RLALPAGVIE
TLVPVQTRRT VESETKPEVV GTSDLAAPAI GAIHGLEIVL ETVTVLETVT VLGTGPAAEI
TAVPPAAEIT AVPPAAAVAL GAVRTGAEPR MRPSSPGPSS PAPSSPGPSA ALAGVPAPLP
TAAARARASY VQEGTGNVSA LSSPSSPRDH LDRGWLRRVL SIGVAVSVLV LSTVVIVDTL
RERSAAATSR EAARGAMRLL DREPDLAGQL AVSAYRMAPT SAAAEALVNA SIRQIGPATG
AIRDLLITPD GRYLIIVGDF GGSVWNIIGP GRVRYITDLP AVAAAMGDRS PLAAGVAPAA
AVKGLLAVAL IPASGPATGV RASTIMVTAG TDGVIRLWRL AQPGSTDDGD LQSAINTGQR
VSLLAELRGH TGAVASVAVS GDGRTLASAG ADRVVRLWDI GHPQNPRALA ELPQPAEVTS
LAFTPDGDSL AVGGVGHLSV WDVTAAGQPR RRAQLTAPAT VRKLLVSPDG RWLAVASTSD
GGSLTEIYGL DSPRGLHRLT AIASRPGQAG SIALSADGRV LAVSTPAGQV TLWDMRSPSR
PVQRATLPVG TAPTATVFGP LGHEGVLAVV AGDAVRLWQL DLLAAEDEIC ARAEGRINRE
QWRTYLGHRH YDPPCD