Gene Franean1_1777 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1777 
Symbol 
ID5670179 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2133971 
End bp2135833 
Gene Length1863 bp 
Protein Length620 aa 
Translation table11 
GC content76% 
IMG OID641240698 
Productserine/threonine protein kinase 
Protein accessionYP_001506121 
Protein GI158313613 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00817012 
Plasmid hitchhikingNo 
Plasmid clonabilitydecreased coverage 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.221734 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACCTTCA CCGACGATTC CGGCGATCGG CTGGGGCGGC ACGGCGGCGA GCCCGTCGTG 
CCGGGAATCC TCCTGCACGG CCGCCGATAC GACGCCGGCG CCGGCGAGGT CTGGGCCGGC
CGGATCGAGC GGCTCGGGAT GGACGTGACC ATCCGCTACG TTCGGTTACC GGCCGACTCG
CTGCTGCGCG CCGAGGCCTT CGACGAGGCC CAGCTGCTGC TCGGCATCCG CCATCCGCAC
CTGGCCTCCG TGGTGGACGT GATCCGCACA CCCGACGGCC TGGCCGTGGT CACCGAGCCG
GTGCACGAGG CGGTCAGCCT GTCCCGGCTG CTCGGCGCGC GGGGTCGGCT CGACCCGGGC
GAGGTCGTCA CCATCGGCCT ACCCGCGGCG CAGGCGCTGG CCGAGGTGCA CGCGGCCGGT
CTGCACCACG GCGGGCTGAC CGCCCTGGAC ATTCTCCTCG AGCCGAACGG GCGGCCGGTT
CTCACCGGCG CCGGCATCGC GGCCCTCACC GACCCGGGCG TGTCTGCCTC GGAGGACGTG
CACGACCTCG CCGACCTCCT GCTCGGCGCG ATGCGCCAGG CGACGGGGCC CGATGCCGCC
GCGGTCGCCG TCGCGGTCGC GATGGCCCTG GTGGACGACC CGCGCCGCCG GCCCTCGGCC
GCCGAGCTCG CGGCCGGACT GGCCCGCAGC GCCACGCCAC TTCCCGTCCG GATGACGGAC
GTGCCATCCG GGGCAGCAGG GGTTGTCGAG GTCGACCCGG AGCCCGGCCG GCCCGCGGAC
GGCGGGAACG CCGCCCCGGT CGACCAGGGC GACCTGCTGG ATCCCGACGA GAGTCTCCGG
GGCGAGGCCG GCGACAGCGG CCACCCGGAC CCGTGGTCGG CAGCCGAGGA CGTGGCGGGG
CCGGGCGACC CGGCCACGGG CGACCGGGGC TTCGAGGGCG ACCCGGGCTC CGCGGCCGGC
CCACCGGAAA CCCCACGGCC CGCCGGTCGG GATGATCCCT ACGGTCTGTA CCTCGCGACC
GGTCAGGTGA TCCCGGACGA GGACGACGAG CCACCACCGC CCAGGGAGGC CGACGCCGAG
GGCATCCGGC CTCCGCATGA CGGGCGGACC GCACCGGCCG AACCAGCCGA CCTGGTGGGC
TCGCTGCGAC CGGTGAACCA CGACCGGGCC GGCACCCGCC GCCGCGCCGG CCGGGCCGGC
CGGGCCGCGG CGCCCGCCCG CGCCCGGGGC ACCACCCGCT CCCCGGCACC GCGCGCGGCC
AGGGCACGCC GGGCAGCGGG GGAGCGGGCC GGCGGCCGGC GGCGGCACCC CGTGTTGTTC
CCGGTGCTCG CGGGCGTCGG TCTGGTGATC GTCGCCATCG CCGCGCTCCT GCTGGCCCAT
GACTCCGGGT CGGACGCCAC CGGTCAGCCC GTGCCCACCG GGAACACCGC CCGCGCGGAC
CCGACCGCGA GCCCCGTCAA CGGGCAGACC CCGGAACAGG TGTGGCGAGC CGTGCTCGAC
GAGCTCAACA CCGCCCGCAG CAAGGCCTTC GAGCGGGCGG ACGAGACCGC CCTCGACCTC
GCGGACGCGC CCGGCAGCGC CGTCCTGGAG TCGGACCGGA CATCGATCCG GGAAATGGTC
CGGCGAAGCG GGCGCTCCAC GCCGGTGCGG ATCGAGATCG TCGACGTCGT GGTTCGTGAG
GAGGAAGCCG ACCGGGTGGT CCTACGGGTC ACCGAGATCG TGGGGGCCTA CGACTTTGTC
GGCGAGGCGG GCAATGTGCT GGCCCAACAG CCGGCCAGCC CGCCGGAGAC GAAGGATCTC
ACCCTGTGGC GGACCGAGGC GGGTTGGCGG CAGGCCGAGA GCGTCAAGAC GGCCCCCAGC
TGA
 
Protein sequence
MTFTDDSGDR LGRHGGEPVV PGILLHGRRY DAGAGEVWAG RIERLGMDVT IRYVRLPADS 
LLRAEAFDEA QLLLGIRHPH LASVVDVIRT PDGLAVVTEP VHEAVSLSRL LGARGRLDPG
EVVTIGLPAA QALAEVHAAG LHHGGLTALD ILLEPNGRPV LTGAGIAALT DPGVSASEDV
HDLADLLLGA MRQATGPDAA AVAVAVAMAL VDDPRRRPSA AELAAGLARS ATPLPVRMTD
VPSGAAGVVE VDPEPGRPAD GGNAAPVDQG DLLDPDESLR GEAGDSGHPD PWSAAEDVAG
PGDPATGDRG FEGDPGSAAG PPETPRPAGR DDPYGLYLAT GQVIPDEDDE PPPPREADAE
GIRPPHDGRT APAEPADLVG SLRPVNHDRA GTRRRAGRAG RAAAPARARG TTRSPAPRAA
RARRAAGERA GGRRRHPVLF PVLAGVGLVI VAIAALLLAH DSGSDATGQP VPTGNTARAD
PTASPVNGQT PEQVWRAVLD ELNTARSKAF ERADETALDL ADAPGSAVLE SDRTSIREMV
RRSGRSTPVR IEIVDVVVRE EEADRVVLRV TEIVGAYDFV GEAGNVLAQQ PASPPETKDL
TLWRTEAGWR QAESVKTAPS