Gene Franean1_0144 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_0144 
Symbol 
ID5668569 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp173237 
End bp175087 
Gene Length1851 bp 
Protein Length616 aa 
Translation table11 
GC content75% 
IMG OID641239073 
Productserine/threonine protein kinase 
Protein accessionYP_001504517 
Protein GI158312009 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.161088 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0176124 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGATCACA ACGGGGAACG CGTCGCTCCG GCGGCGCGCG GGCAGCCGGG TGGCATGGCA 
CCACCCGGTG AGGTCCTGCA CCCGCACGAA CCCCGGGCGA TCGGCCCGTA CCGCCTGCTG
CGCCGCCTGG GCGCGGGCGG GATGGGCACG GTGTACCTCG CCGAGTCCGC CGGCCGCCGG
GTCGCGGTGA AGGTCGTCCG GCCGGACCTG GCCGCCGACG AGGAATTCCG GCGGCGGTTC
CGGCAGGAGG TCGACGCCGC CCGCCGGGTC GCGCCGTTCT GCACCGCCGA GGTGCTGGAC
GCCGACCCCG ACGCCGCCGC CCCCTACCTG GTGACCGAGT TCATCGACGG CGTCCGGCTC
GACCACGCGG TGGAGAACGG CCCGATCAGC GGCTCCACGC TGACCGGGCT CGCCGTCGGC
GTGGCCACCG CGCTGACCGC GATCCACGGC TCCGGTCTCG TCCACCGTGA TCTCAAGCCG
AGCAACGTGC TGCTGTCGAT GTCCGGCCCC CGGGTGATCG ACTTCGGCAT CGCGCAGGCG
CTCGAGGGCG CCAAGGCGAA ACCCACCGCG TGGGGCTTCG GGTCCGCCGG CTGGATGGCC
CCCGAGCAGG TGAACGGCCA GGCCATCGGC CCGGAGGCCG ACGTGTTCAC CTGGGGCATC
CTGGTCGCCT ACGCGGGCAC CGGCCACCAC CCGTTCGGCA CCGGGACGGA CCTGGAGCTC
AGCACCCGCA TCGTCGGCTC CCGCCCGGCC CTGGACGGCC TGCCGCACGA TCTGAAGGCG
CTGGTCGCGG CCGCCCTCGC CAAGAACCCC GACGACCGCC CGAACGCCCG CGACCTGCTG
CTGAGCCTGA TCTCGCTGCC GCCCACCCGA TCCGGCGACG GCCGGGACGA CGGCACCGGG
CGGCCGAGCG CCAGCGCCGT CGGCCAGGCC GAGGAACTGC TGAACCTCAC CAGGGGCACG
ACCCTGCCCG GCCTCGCGCC GACCCGGGTC GCCCCGGGGA CGTCCGGTGA GCGGGCCCCG
GCCAGCCACG GCGGCCCCCC AGGTCACGGT GGCGGGGCGG GGCCGTCCGG TGTCGCTGTT
CCGGCGCCGG CGGGCGCGCC GGCCTTCTCC GGGGCCCCCG GAGCGGCACG CGGGCCGGCT
CCCGGCGGCG GCCCCGCGGG ATTCGGGCCC TCCGGCTTCC GCCCCGCCGG TGCTGGCGCG
GCACCGCACC CACCCACGCA CGGGCCAAAC CCGCCGGTGC AGGGGCCGCG CCCGCCCGGT
CAGGGCCCGT ACCCGCCCGT CCCCTCTGGG ACCAGCCAGC CTCGCTCCAC GTGGCACCGC
GTGCTCCTGT TCGGCGCTGC CGCGCTCGCA GTCGTCCTGG CCGCCTGGCT GATCAGCTCG
GTCGCCGGCG GCTCGGACGA TCCGGGAGCA GTCTCGGGAG CCGGCGACCG GCCCGCGGCG
TCCCAGCAGC CGACGACTCC GCGCAGCGGG GGCCTGAACA CACCCGTCCG CGACGACCAG
CTCGAGTTCA CCGCGAGCAA GATCAGCTGT GGCGCCGACC ACGTCGGCGA CGGGTTCCTG
GCCCGCCGCC CGAGCGGCCA GTTCTGCCTG GTGGACATGA AGATCACCAA TGTGGGAACG
GCCGAGCGTG TTCTGAGCAA CTCCAACCAG TACCTGCACA CCTCGGACGG CGGCCGCCAT
GCCGCCGACT TCCTGGCCCG CTACTGGATC AACGACGGGA TCTGGGACAC GATCTCGCCC
GGCGCCACGG TCACGGGCAC CTTCGTCTTC GACATCCCCG TGGACGCGGA GGCCGAGGAG
CTCGAGCTGC ACGAGCGGCC GGAAAGCCCG GGCGTCACGA TCCAGCCGTA A
 
Protein sequence
MDHNGERVAP AARGQPGGMA PPGEVLHPHE PRAIGPYRLL RRLGAGGMGT VYLAESAGRR 
VAVKVVRPDL AADEEFRRRF RQEVDAARRV APFCTAEVLD ADPDAAAPYL VTEFIDGVRL
DHAVENGPIS GSTLTGLAVG VATALTAIHG SGLVHRDLKP SNVLLSMSGP RVIDFGIAQA
LEGAKAKPTA WGFGSAGWMA PEQVNGQAIG PEADVFTWGI LVAYAGTGHH PFGTGTDLEL
STRIVGSRPA LDGLPHDLKA LVAAALAKNP DDRPNARDLL LSLISLPPTR SGDGRDDGTG
RPSASAVGQA EELLNLTRGT TLPGLAPTRV APGTSGERAP ASHGGPPGHG GGAGPSGVAV
PAPAGAPAFS GAPGAARGPA PGGGPAGFGP SGFRPAGAGA APHPPTHGPN PPVQGPRPPG
QGPYPPVPSG TSQPRSTWHR VLLFGAAALA VVLAAWLISS VAGGSDDPGA VSGAGDRPAA
SQQPTTPRSG GLNTPVRDDQ LEFTASKISC GADHVGDGFL ARRPSGQFCL VDMKITNVGT
AERVLSNSNQ YLHTSDGGRH AADFLARYWI NDGIWDTISP GATVTGTFVF DIPVDAEAEE
LELHERPESP GVTIQP