Gene Franean1_3629 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_3629 
Symbol 
ID5671996 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp4299743 
End bp4300858 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content68% 
IMG OID641242513 
Productserine/threonine protein kinase 
Protein accessionYP_001507933 
Protein GI158315425 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTCCTGT TACGCGAGGG GCAGGTAGTC CGGGATACCT ACAAGGTCGA CCGGCTGCTG 
GGCGAGGGGG CTTTTGCCGA GGTCTACCGG GTCGAGCACC GCTATCTTGG CCGGCAGGCG
ATGAAGGTCT TCCGTCAGGT GGGAATGTCC GCGGAACAGG TGCGCGACGC ACTCGGTGAG
GCTATGTTGC TGTCCGGCAT GGGGCACCCC AACGTCATCC GTGTGTTCGA GGCGAACACC
GTCGAGACGG CCGGTGGGGT GTATGCGTAC TTCACCATGG AATACGTGGC CGGTGGAACC
CTGCACAGTT TCTGGTCGTC GTACGGCACC ACGTTCGTCC CCATCCCGAC CGTGGTGGAC
ATCCTGCGGC AGATAACGCG CGGCCTGGCG GTCGCGCATC GGGAGTCCCC GCCGATCGTC
CACCGCGACA TCACTCCGCA GAACATTCTG GTCGGCTACA GCGGGGCCGG TCTGCAGGTG
CGGATCAGCG ATTTTGGTCT CGCCAGGAAG GTCAGTGCTC TGACCCTGCT GGCGAGCTCC
CAGGGAACCA TCGCCTTCAT GGCGCCGGAG ACCCTGCTCC ACCCGCATCT CGCGTCGGTT
CCGGGGGATG TCTGGGCGCT GGGCGCGGTG CTCTACCTGC TGCTGACCGA CCGGCTGCCC
TACCCGCAGC GCTCAGGCGG CGACCCGATC ACCGCCGCCT GGAACACGGG CACGCTCGTA
CCGCCCAGCG AGATCCGGTA CTCCGTGGAC GAAGCACTGG ACGGAATCGT CGCCCGGGCG
CTGTCCTACG CCCCGGCGAA ACGCTATCCG TCGGCCGTCG AGATGCTCGC CGACCTGGAG
GTGTGGGAGC CAGGTAGGTC CGTGCCGCTG AGCCCGCCTG AACGGGAACC CGGGCCGGCC
GGGGAGATCG CCGCGGGCAA GAACGCGCTC GGCCGGTCGT CCCCCGCGGA CGAGGCCGCG
GCCGTGCGGA TGGCCCGCCA GGCGGTCACC GTCGCCACCG GCGGCCAGTA CGACCGGGCC
GCCGACCTGA TGGAGGAGGC CTGCAACAAG TGGCCCGGGC TGCGTGACCA GTACGCGTCA
CGGATCCGGC TGTGGCGCAA GGGGGTGACG ATGTGA
 
Protein sequence
MVLLREGQVV RDTYKVDRLL GEGAFAEVYR VEHRYLGRQA MKVFRQVGMS AEQVRDALGE 
AMLLSGMGHP NVIRVFEANT VETAGGVYAY FTMEYVAGGT LHSFWSSYGT TFVPIPTVVD
ILRQITRGLA VAHRESPPIV HRDITPQNIL VGYSGAGLQV RISDFGLARK VSALTLLASS
QGTIAFMAPE TLLHPHLASV PGDVWALGAV LYLLLTDRLP YPQRSGGDPI TAAWNTGTLV
PPSEIRYSVD EALDGIVARA LSYAPAKRYP SAVEMLADLE VWEPGRSVPL SPPEREPGPA
GEIAAGKNAL GRSSPADEAA AVRMARQAVT VATGGQYDRA ADLMEEACNK WPGLRDQYAS
RIRLWRKGVT M