Gene Franean1_5221 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_5221 
Symbol 
ID5673555 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp6268037 
End bp6270187 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content78% 
IMG OID641244075 
Productserine/threonine protein kinase 
Protein accessionYP_001509485 
Protein GI158316977 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0344538 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGAGCAC GGAAGAGTAC GACGACCCCG TCAACCGGCC CGGTCGGCGC GTTCGGGCGC 
GATCTTCGCG CGGCGCGGTG CGGGGCGGGA AATCCCTCGT ACCGGACCCT GGCGCGACGC
AGCGGAATGC CGGCGGAGAT CCTGGCCGGC GCCGCCAGCG GCGGGTCGCT GCCGGCGCTG
GACGTCCTGT GTGCCTTCGT GTCCGCCTGC GGTGGGGACG TCGGCTCCTG GGTGGCGCGG
TGGCACGAGC TCGCGGCGAT CCTGGAGGCG GAGCGCGAGA CCCGGATAGC CCCGCCCGGG
GCGGACGTCC CGTCGCAGCC GCCGCCGGTC GGGCCGTGGC CGCCGCGCAC CAGCGGCGCG
CCGCCCGGAT CACGCCCACC CCACGCGTCC CCCGGCAGGG TTCCCTCCCA GCGTTCCAGC
GGCGGGCCGG CCGACGACTT CCTCGCCCCG CTCAGCGCCG ACGACCCTCG GGAGGTCGGG
CCGTTCCGGC TGCGCGGCCG GCTGGGCTCC GGCGGCATGG GCGCGGTGTA TCTGGGCCAC
TCCCCGGGGC AGCGGCCGGT CGCGGTGAAG GTCATCCGCG CCGACATGGC GTCCGACAGC
GAGTTCCGGC GTCGTTTCGA GCGGGAGGTC GCCGCGATGG GCCGGGTGAA CAGCCTGTTC
ACCGCGCCGC TGATCGCCGC GGACGTCGCC GCGGACCGCC CGTGGCTCGC CACCGCCTAC
ATCCACGGCC CGACGCTGCG CGACAGCGTG CTGCGCAACG GCCCGCTGCC GCCGTCCAGC
CTGCTCCGGC TGGCCGCCGG GGTGACGGAG GCGCTCGTCG CCATCCACGG CGCGGGGGTC
GTGCACCGTG ACCTGAAGCC GGCGAACGTG CTGCTGGCGA TCGACGGGCC CCGCGTGATC
GACTTCGGGA TCGCGCGGGC AGCCGACCAC GCGGGCAGCA CCACCACCGG GAAGGTCATC
GGATCGCCGC CGTACATGTC GCCCGAGCAG GCCCGCGGCG AACGGGTGGA CGCGGCCTCC
GACGTGTTCG CCCTCGGGTC GGTGCTCGCC TTCGCCGCGA CCGGGCGCAA CGCCTTCGGC
GAGGGCAACA CCGCCGATGT GATCTATCGC GTGGTCCGGG GGGAGCCGGA GCTGACCGGC
GTGGACGGCG ACCTGCGCGC GCTGATCGAG TCGTGCCTGG CCAAGGCGCC GCAGCGGCGG
CCGACGCCGG CCGAGATCCT GGGCCGCTGT CACGCCCAGC TCGGGGCCAG CCCCCGCCCG
CCCAGCTGGC TGCCCGTGCC GGTGATCGCC GAGATCAGCC AGCGGCTGCG GCATCCGGCC
GTGGTGGACC GTCCCACGGA GCCGGCCCGC CGCCCGGTGC GGGGGCTGGT GGTCGCGGCG
TCCGTGCTGG CCACCGCCAC GGTCGTCGCG ATGACGCCGG CGCGCAGCGT CCTCACCCCG
TGGGAGCTGC TCCCGAGCTG GGGCGACGGC GCGCAGGCCC CGCGGGCGCC GTCGAGCCCG
CCGCCCACGC CGACCGCCGG GGGCGTGGCG GAACGGCGGT CGGCCGAGCG GCAGTCGGCG
GGCAGCCGGC GCACCACCGA CGGCCGCGGC GCCGGGACAT CCGCCGGCAC GCCGGGTGGC
ACGTCGGACG GCCTGGCGGA CGGCGCGGCG GGCGGCACGT CCGCGGGGAT GCCCGGCGAA
GCGACCGGAG GCACCATCGG CGGCGTCGCG GGCGGACTGG AGCAGGGCCT GCCGGGCACG
GGCCAGGATC CGGGAACGGG CCGGGGACCG GCCGGCGGCC CGGTTCAGGC GCCGGCCGCC
CCGCAGTCCG GCGGGAGCGG CGGCCAACCG GCCACACCCG GGACACAACG CCCACCGGCC
GTGCCGCAGG CGCCGGGTCC GGGTCCGGGT CCGTCCTGGC CGTTCCCGTC GCAGGGCTCG
GGCGCGACGA ACCCCCCACC AGCGCAGCCG CACCCGACGC CGGGATCACC CATGGCCACC
GACCCGCCCC CGCCAGCAAC GCCGGAGGAC CTGCCCAGGA CCACCGCACC CCCGCCGGAG
ACGCCCCCGC GGACCACTCC GCCACCGGAA CCGGCACCGC CCGAGACCCC GGCGGCGCAC
CCCGGCTCAC CGCCGGCGGA GACCGCCCGC ACGGAGGCTC AGGACCTCTG A
 
Protein sequence
MRARKSTTTP STGPVGAFGR DLRAARCGAG NPSYRTLARR SGMPAEILAG AASGGSLPAL 
DVLCAFVSAC GGDVGSWVAR WHELAAILEA ERETRIAPPG ADVPSQPPPV GPWPPRTSGA
PPGSRPPHAS PGRVPSQRSS GGPADDFLAP LSADDPREVG PFRLRGRLGS GGMGAVYLGH
SPGQRPVAVK VIRADMASDS EFRRRFEREV AAMGRVNSLF TAPLIAADVA ADRPWLATAY
IHGPTLRDSV LRNGPLPPSS LLRLAAGVTE ALVAIHGAGV VHRDLKPANV LLAIDGPRVI
DFGIARAADH AGSTTTGKVI GSPPYMSPEQ ARGERVDAAS DVFALGSVLA FAATGRNAFG
EGNTADVIYR VVRGEPELTG VDGDLRALIE SCLAKAPQRR PTPAEILGRC HAQLGASPRP
PSWLPVPVIA EISQRLRHPA VVDRPTEPAR RPVRGLVVAA SVLATATVVA MTPARSVLTP
WELLPSWGDG AQAPRAPSSP PPTPTAGGVA ERRSAERQSA GSRRTTDGRG AGTSAGTPGG
TSDGLADGAA GGTSAGMPGE ATGGTIGGVA GGLEQGLPGT GQDPGTGRGP AGGPVQAPAA
PQSGGSGGQP ATPGTQRPPA VPQAPGPGPG PSWPFPSQGS GATNPPPAQP HPTPGSPMAT
DPPPPATPED LPRTTAPPPE TPPRTTPPPE PAPPETPAAH PGSPPAETAR TEAQDL