Gene Franean1_4944 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_4944 
Symbol 
ID5673283 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp5934691 
End bp5936982 
Gene Length2292 bp 
Protein Length763 aa 
Translation table11 
GC content76% 
IMG OID641243798 
Productserine/threonine protein kinase 
Protein accessionYP_001509214 
Protein GI158316706 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.208032 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.80492 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACGGTCG ACGGCATCGC CTGCCCGGAA CCCGACTGCG ACGGAATCAT CGAGGACGGG 
TACTGCAACG TGACCGGGCT GGCGTACCGG CCACCCGAGC CTGACCCGGG GCCACCGCCC
CACCCCGACC CGACCGGGCC TGGCACGGGC GGGTCGGGCA CGAGCGGGTC CGGCACAGGC
GGGTCGGGCG GGAGCGGGTC GGCTGGCGGG TCCGCGTCGT GGAGTTTGAC GGCGGGCGGG
ACACCGCGCC GGCGTCGACG CCCCGGCCGG GCCCGTCGTC CCCAGAGCCG GCTCGGGGAG
GGGCTCGTGG ACGTCCCGGA CATGCCGACC CCCGATCCGG AGTCGCTGCT GCTCACCGAC
CCGTGTGTTC CGGAGCACCG GCGGGTGTGC TCGGCCTGCG GGGCCGAGGT CGGGCGGGCC
CGCGACGGCC GGCCGGCTGC CGTCGAGGGC TTCTGTGTCG TGTGCGGGCA CCAGTTCTCG
TTCGTCCCGG CGCTGCGCGC CGGTGACCGG GTGGGCTCCT ACGAGATCGC GGGCGCGCTG
GCGCACGGCG GGCAGGGCTG GGTCTACCTC GCCCGCAACC GTGAGGTCGC GGACAACTTC
TGGGTCGTGC TCAAGGGCCT GCTCGACAGG GGTGACCCGG ACGCCCAGGC CGCCGCGATC
GCCGAGCGCC GCTTCCTCGC CGCGGTGGAC GACCCGGCGA TCGTGCGCAT CCACACGTTC
GTCAGGCACG CGGGTACCGG CTACATCGTC ATGGAGTACG TCGGCGGGAC GTCGCTGCGC
GAGGTCCTGC GGCAGCGCCG AGCGGAGGGG GGGCGTCCCG ACCCACTGCC GGTCACCCAG
GCGATCGCCT ACATCCTGGC CGTGCTGCCG GCGTTCGCCT ACCTGCACCG CAACGGCCTG
GTGTTCTGCG ACTTCAAGCC GGACAACGTC ATGCTCGGCC GCGAGAGCCT GCGGCTGATC
GACCTGGGGG CGGTCCGCCG GCTCGACGAC GAGAGCGGCG CGTCCTACCG GACCCGCGGC
TACTCGGCCC CGGAGGTCGA GACCGAGACG CCCACCGTCG CCTCCGACCT GTACACGGTC
GGGCGGACGC TGGCCACCCT GATCCTGAAC TTCCGTGGGA ACACGACCAC CTACGTCCAC
AGTCTCCCGC CGGCGTCGAC GCACGAGGTC CTGGCGCGCC ACGAGTCGCT CGACCTGTTC
CTGCGCCGGG CCACGGCCTG GCTGCCCGAG GACAGGTTCG TCTCCGCCGA CGAGATGCGC
GACGAGCTGC TCGGCGTGCT GCGCGAGATC CTCGCCGCGG AACGGGGCGC ACCCGTGCCC
GCGCCGAGCC GGCGGTTCAC CGGGGACGTC CACCTGACCG GGGAGGACGC CGACGGGTCG
GTCGTCAGGC CGCGGTGGGC GAGCCTGCCG CGGCTGCGCG TCGACCCGGA GGATCCGGCC
GCGGGCACCC TCGCGGCGCT GCCCGACAGT TCGCCCGCGC AGCTCGCCAC GCTGCTGGCG
GCCATCAGCC CGGCGACGGT CGAGGTCCGG CTCCGGCTGG CCCGCGCCCA CCTCGAGACC
GGGGACACCG CCGCCGCCGC GGCCGTGCTC GACGAGGTCG AGGCCGAGGA TCCGTTCGAG
TGGCGGGTCC GCTGGTACCG GGGGCTGCTC GCGCTCGGCG ACGGCGACGG CGACGGCGAC
ACCGCCGCGG CGGCCGCGGT GTTCACCGAG GTGTACGCGC AGGTGCCCGG GGAGCTCGCG
CCCAAGCTGG CGCTGGCCGT CACGGCCGAG GCGGCCGGCG ACCAGGCCCG GGCCGCCACG
CTGTTCGACC TGGTCTCGCG GACGGACGAC GGATTCACCA GCGCCGCGTT CGGCCTCGCC
CGGGTGCGGG TCGCGGCGAA GGACCGCGCC GGTGCCGTCG CCGCCTACGA GCGGGTTCCG
CCCTCGTCGG CGGCGTATCA GGAGGCCCGG ATCCGGACCG CGCTCGTCCG CGGGACCAGG
ACGGCGGCCG GTGTGCCGCG GCCCGCCGAC CTGGTCGCCG CGTCGGGCAT CCTCGCTGGC
CTGGACGTCG ACCGGCGGCG CCGGGTGGCA CTCACCCGGG ACCTGCTGCG CTGCGCGCTG
GACCTGCTCC TCGCGGGAGA CACGCCACCC GACCCGGACG TCGAGGTCGC CGGCACCCGC
CTACGCGAGG ACGACCTGCG CTTCGGGCTG GAGCGCGCCT ACCGGGAGCT TGCCACCCTG
GCGCGGAGCG CGCAGGAGCG CTACGACCTG GTGGATCTCG CGAACGCCGT GCGCCCGAGG
ACGTGGCGGT GA
 
Protein sequence
MTVDGIACPE PDCDGIIEDG YCNVTGLAYR PPEPDPGPPP HPDPTGPGTG GSGTSGSGTG 
GSGGSGSAGG SASWSLTAGG TPRRRRRPGR ARRPQSRLGE GLVDVPDMPT PDPESLLLTD
PCVPEHRRVC SACGAEVGRA RDGRPAAVEG FCVVCGHQFS FVPALRAGDR VGSYEIAGAL
AHGGQGWVYL ARNREVADNF WVVLKGLLDR GDPDAQAAAI AERRFLAAVD DPAIVRIHTF
VRHAGTGYIV MEYVGGTSLR EVLRQRRAEG GRPDPLPVTQ AIAYILAVLP AFAYLHRNGL
VFCDFKPDNV MLGRESLRLI DLGAVRRLDD ESGASYRTRG YSAPEVETET PTVASDLYTV
GRTLATLILN FRGNTTTYVH SLPPASTHEV LARHESLDLF LRRATAWLPE DRFVSADEMR
DELLGVLREI LAAERGAPVP APSRRFTGDV HLTGEDADGS VVRPRWASLP RLRVDPEDPA
AGTLAALPDS SPAQLATLLA AISPATVEVR LRLARAHLET GDTAAAAAVL DEVEAEDPFE
WRVRWYRGLL ALGDGDGDGD TAAAAAVFTE VYAQVPGELA PKLALAVTAE AAGDQARAAT
LFDLVSRTDD GFTSAAFGLA RVRVAAKDRA GAVAAYERVP PSSAAYQEAR IRTALVRGTR
TAAGVPRPAD LVAASGILAG LDVDRRRRVA LTRDLLRCAL DLLLAGDTPP DPDVEVAGTR
LREDDLRFGL ERAYRELATL ARSAQERYDL VDLANAVRPR TWR