Gene Franean1_4090 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4090 
Symbol 
ID5672448 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4873850 
End bp4876120 
Gene Length2271 bp 
Protein Length756 aa 
Translation table11 
GC content76% 
IMG OID641242966 
Productserine/threonine protein kinase 
Protein accessionYP_001508383 
Protein GI158315875 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.252638 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones17 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCCCGCCG TCGACCGCGC GCGGATCGCC CACGCGCTGC CCCGCTACAC CCTCGGCGAC 
CAGCTCGGCT CGGGCTCGTT CGGCCTGGTC ATCGCCGGGC ACCACCAGGA TCTCGACCGC
CCTGTCGCGA TCAAGATCCT GTCGGCGGCG CTCACGGACG ACTTCCGGGC CGAGGCGCGG
ATGCTCAGCC GCCTCGACCA CCCGCACATC GTCCGCATCT ACGACTACGT CGCCCAGGAC
GACCTGTGCC TGCTGATCAT GGAGAAGTTG GGGGGCGGCA GCCTCGCCCA GCACCGCCTG
CGGGCCGAGG CGGCCCTCGC CGTCGGGGTG GCCGTCGCCG ACGCGCTCGC CCACGCGCAC
GCCCACGGCG TGCTGCACCG CGACATCAAA CCGGACAACA TCCTGTTCAC CGACGCCGGG
CAGCCCAAGC TCACCGACTT CGGCATCGGG AAGATCGTCG AGGGTGGGGC GGGTGCGGTC
AGCCGCGCCG TCGGCACCCC GAAGTACATG GCGCCCGAGC AGATCACCGG TGCGCCGGTC
GGGCCGCCCG CGGACCTCTA CGCGCTCGGC GCCGTCCTCT ACGAGCTGGC CGCGGGCCGG
CCCATGTTCG ACCCCGGGCT GGGGGTCCCG CAGCTGCTGC GCCATCAGTG CGAGGTCGAC
CCGCCCGTGC CGCCCGGCGT GCCGCCGGTG GTGAGCGACA TCATCCTGCG TGCGCTGGCC
AAGGATCCCG CCGCCCGCCA GCCCGGCGCG CACGAGCTGG CCCGCGCCCT GGCGGAGGCG
GCCCTCGGCC TGTTCGGCCC GGACTGGCTG GCGCGCAGCG GGCTGATCGT CCGGCGCACC
GAGACCACGG ACGCACCGGA TCCGCCCGGT GGCACCGGCA CCGTCCGCCG CTTCACCCCG
GCCGCGGGAC GGTCCGTCCC GCCCGGACCC TCCGGTCCGG TGAGCCCACC CGGCGAACCC
GGCCCGCCGG GCTCACCGGG GGCGGCTGAC CTGGCGGTGA CGTCCGGTCC GGTGGCGGTT
CCGGTCGCGC CGGGTGGGCC CGGTCCGGGG GGCTCCGCGG ATTCCGGCGG CGCGTCCGGG
CCGCCGGGCC GGAACCTGTT CGAGGGCACC CCGTACGGCG CGCGCCGGCG TACCGCCGGG
TGGGGCGGGC GGCGCGTCAC CGCCGCCGTG GCCGCCGGGG TGGCCGCGGT CGCCCTGGCC
GTCACCGGAG TGGTCGTGCT CGTCGGCGGC GGTGCGGGCT CCGACGACCA GGTCGCCGTC
CCCGTCACCC CGACCGCCCC GCCGCGGACT CCCGCGCCCG CGACGATCGA CAGCATCGCC
GCCTGGGCCG CCGCGCCGTC CGGCGGCTTC TACGTCGTCG AGGACGGCGG CACCCGGCTG
TTGCGGCTGG GTTTGGACGG GAAGGTGTCC GTCGTCGCGG GCACCGGTGC GCAGGGGTCG
GACGGTGACG GTGGTCCCGC CGTCCAGGCA CGCCTGCGTG GGCTCGACGC CGTCGCGGTC
GACGGCGCCG GCACCGTCTA TCTCGGCGAG GACTCGAACG GCAAGATCAG GAGGATCGGC
CGCGACGGGG TGATCACGAC GGTGGTGGGT GGCGGTACGC GCCTCGCCCG GGAGGGCGAG
CCGGCAACCG AGGTCTCGCT CTCCCTCATC TCAGGCCCGA TAGCCGTCGA CACGGACGGC
GACCTCTACC TCTACGGCGC GGGCCGGATC TACCGGGTCG ACCGCGACGG CATCCTGCAC
GTCGTCGCCC GCACGCAGGA GCCGGGATCC ACCGAGGCGC CCCCGGAGGG CGAGGAGGAC
GTGCCCGTCG TCGCGGCCAA CATCGGTGAC CTGGAGGTAC GCGACGGACA GGTCTACATC
GCCGACTACT CCGCCGACCG GATCCAGGTG CTCGGCCCGG ACGGCGCCGT GCGCACCCTG
GCCGGCGGCG GCCGGGAGGG CGACAGCGGG GACGGCGGCC CGGCGACCGC CGCCGCCCTG
AGCCTGTCCG TCGAGGCCTC TGTCCTCGCG TTCGACCCGG CGGGCGGTCT GTACGTCGTC
GAGTCCCTCG GCAACCGGGT GCGGCGGGTC GACCCCGGCG GGACCATCAC CACCGTCGCC
GGCACCGGCG AGCTCGGCTC GGACGGCGAC GGCGGCCCCG CCGCGAAGGC CCGGCTCTGG
TCTCCCCGCC GCATCACGGT CGACGCCGCG GGCGTCCTGT ACGTGAACGA GACCGGCACG
ACCATCCGAC GGGTCGGCCT CGACGGGATC ATCACCACGA TTCACGAGTG A
 
Protein sequence
MPAVDRARIA HALPRYTLGD QLGSGSFGLV IAGHHQDLDR PVAIKILSAA LTDDFRAEAR 
MLSRLDHPHI VRIYDYVAQD DLCLLIMEKL GGGSLAQHRL RAEAALAVGV AVADALAHAH
AHGVLHRDIK PDNILFTDAG QPKLTDFGIG KIVEGGAGAV SRAVGTPKYM APEQITGAPV
GPPADLYALG AVLYELAAGR PMFDPGLGVP QLLRHQCEVD PPVPPGVPPV VSDIILRALA
KDPAARQPGA HELARALAEA ALGLFGPDWL ARSGLIVRRT ETTDAPDPPG GTGTVRRFTP
AAGRSVPPGP SGPVSPPGEP GPPGSPGAAD LAVTSGPVAV PVAPGGPGPG GSADSGGASG
PPGRNLFEGT PYGARRRTAG WGGRRVTAAV AAGVAAVALA VTGVVVLVGG GAGSDDQVAV
PVTPTAPPRT PAPATIDSIA AWAAAPSGGF YVVEDGGTRL LRLGLDGKVS VVAGTGAQGS
DGDGGPAVQA RLRGLDAVAV DGAGTVYLGE DSNGKIRRIG RDGVITTVVG GGTRLAREGE
PATEVSLSLI SGPIAVDTDG DLYLYGAGRI YRVDRDGILH VVARTQEPGS TEAPPEGEED
VPVVAANIGD LEVRDGQVYI ADYSADRIQV LGPDGAVRTL AGGGREGDSG DGGPATAAAL
SLSVEASVLA FDPAGGLYVV ESLGNRVRRV DPGGTITTVA GTGELGSDGD GGPAAKARLW
SPRRITVDAA GVLYVNETGT TIRRVGLDGI ITTIHE