Gene Franean1_2372 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2372 
Symbol 
ID5670768 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2816874 
End bp2818790 
Gene Length1917 bp 
Protein Length638 aa 
Translation table11 
GC content76% 
IMG OID641241289 
Productserine/threonine protein kinase 
Protein accessionYP_001506710 
Protein GI158314202 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.354832 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0671024 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCTGACGC CCCTGACCAC GGACGACCCG CAGCGGATCG GTCCGTACCG GCTGGCCAAC 
CGCATCGGCG CGGGCGGCAT GGGCATCGTC TACCTCGGCT TCTCCGACGA CGGCAAGCCG
GCCGCGATCA AGGTTCCCTC GGCGGGGCTG GTGGACGACC CCGAGTTCCG CGCCCGCTTC
CGCCAGGAGG TGGACGCCGC CCGCCGGGTG CGCGGCAACG CCGTAGCCGC CGTCATCGAC
GCCGACCTCA CCGGGACGCG CCCGTGGATG GCCACCGAGT ACGTCGAGGG CCGCAACCTC
ACCGACGCGG TCGCGACCCG GGGCCCGTTC GACGAGCGCC TGCTCACCGG GCTCGCCGTC
GGTCTGGCCG ACGCCCTGGT CGCCATCCAC GCCGCCGGGG TGGTGCACCG CGATCTCAAA
CCGTCCAACA TCCTGCTGGC CTGGGACGGG CCCCGCGTCA TCGACTTCGG CATCGCCCGG
GCCGAGAACA ACACCTCGCA CACCCGGGCC GGCAGCCTCA TCGGCACCCT GACCTGGATG
GCGCCCGAGC AGCTGCGCGG CGAGCGGGCC GGGCCGGCCG CCGACGTCTT CGCATGGGGG
GCGTGCGTCG CGTTCGCGGC GGCCGGGCAG CCGGCGTTCC GCGGCGACCG GGCCGAGGCC
GTCGGCCTGC AGATCCTCAC CGGCGAGCCG GTGCTGGAAC GCCTGCCACC GACCATCGAG
CCGCACGTGC GCGCCGCGCT GCGCAAGGAG CCGGCCGCGC GGCCCAGCGC CGCCGAGATC
CTCGGCGGGC TGCTCGGTCG GCCGGTCAGC GGCCCGGCCG ACTCGGACGC GGCGACCGGG
CTCCTGATGA GCCGGTGGTG GAACCTGCCG CCGACCCCGC CCGAGGGCGC CACGCCGCTG
CGCGGGCACC CACCGGCGGG CAGTCATCCC CGGGACGCCG ATCCACACGG TGCCGGTCCA
CGGGGCGGCC CACCTCCCGG CGGCTCCTAC CGGGGCGCCC GTCCCCACGA CGGGTACCCG
GGCGGAGCGC AGCTCCAGCC CGGCCACCGG CCGAACCCCG GCCCGCCGGA CGCCGGCTAC
CGCGGTCCGG ACTCCGGCCC GCGCGGCTGG GCCGACTCCG GACCGCACGG CGGCTGGGCC
GACTCCGGCC AGTCGGGCGG TGGCTGGCCC GGCTTCGGTG GGCCGGGCGC GGGACCGCCC
GGAGCGGGAC GTCCGGACGG CGGCGGACGG CCCGATCACG GCCGTCGCGG GATGCCCGTG
GCCGTGCTCG CCGCCCTGGC GGTGCTGCTC GTCGTCGGTG GCGTCACCGC GGGAGCCCTG
CTGCTGTCGG GCAGCGATGG AGACGGCGGC CAGGCCGGCC CGTCCACGGG GCCGGACGTG
ACCTCCACCC TGAGCGGCCC GACCACCGCC CCCAACGGCG GGCCCACCAC CTCGGCCGGG
CCGACCGGGC CGGTCACGAA CCCGACGAGC CCCGGCTCGG GCGCCACCAC GGCGCCGCCG
ACCACGCCGG GAAGCGGGCC GACGTCCACC TCGACGCCAC GGCGGGTGAT GACCGCCGAC
GAGGCGGCCG GGGTTGTCCG CGAGCATGGC TACACCCCCG AGATGGGCTC CTACGACCCG
GACCGGATGC TCAACCTCGT CCGCGGCACC AGGCAGGGGG ATGATGGCCG GCAGCGCCAG
ACGGCGTTCG TCTTCGCCGA CGGCGAGTAC CAGGGCACCG ACACCAAAGC GCCGAGCAAC
GCCATCACGG TGGAGGTCAG GACGAACACC GACGCGACGG TGACCTACCA GACCTACGTC
GCGAACGGAA CGACACCCAC GGGGACGACG TCGGTGCGTT TCCGCTGGAA CGGCACGGAT
TTCGTGGCCC TCGACCCCAT CCCGTCCGAT GATCCGACGG TGGACAACCA CCGCTGA
 
Protein sequence
MLTPLTTDDP QRIGPYRLAN RIGAGGMGIV YLGFSDDGKP AAIKVPSAGL VDDPEFRARF 
RQEVDAARRV RGNAVAAVID ADLTGTRPWM ATEYVEGRNL TDAVATRGPF DERLLTGLAV
GLADALVAIH AAGVVHRDLK PSNILLAWDG PRVIDFGIAR AENNTSHTRA GSLIGTLTWM
APEQLRGERA GPAADVFAWG ACVAFAAAGQ PAFRGDRAEA VGLQILTGEP VLERLPPTIE
PHVRAALRKE PAARPSAAEI LGGLLGRPVS GPADSDAATG LLMSRWWNLP PTPPEGATPL
RGHPPAGSHP RDADPHGAGP RGGPPPGGSY RGARPHDGYP GGAQLQPGHR PNPGPPDAGY
RGPDSGPRGW ADSGPHGGWA DSGQSGGGWP GFGGPGAGPP GAGRPDGGGR PDHGRRGMPV
AVLAALAVLL VVGGVTAGAL LLSGSDGDGG QAGPSTGPDV TSTLSGPTTA PNGGPTTSAG
PTGPVTNPTS PGSGATTAPP TTPGSGPTST STPRRVMTAD EAAGVVREHG YTPEMGSYDP
DRMLNLVRGT RQGDDGRQRQ TAFVFADGEY QGTDTKAPSN AITVEVRTNT DATVTYQTYV
ANGTTPTGTT SVRFRWNGTD FVALDPIPSD DPTVDNHR