Gene Franean1_1869 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1869 
Symbol 
ID5670271 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2245649 
End bp2247799 
Gene Length2151 bp 
Protein Length716 aa 
Translation table11 
GC content68% 
IMG OID641240791 
ProductWD-40 repeat-containing serine/threonin protein kinase 
Protein accessionYP_001506213 
Protein GI158313705 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase
[COG2319] FOG: WD40 repeat 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0650147 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGATTGTTG ATCGTGCGCA TGTGGCTGCG GCACTACCGG GCTATGAGCT GGGTGATCAG 
ATCGGCGCCG GCGCGTTCGG GCTGGTGCTC GCCGGCTGGC ACCGTCGCCT GGGGCGCGAT
GTCGCGATCA AGGTGCTGGC CGCGCGGGAC CGTTCCGGGC TAACCGCCAG TTTCGCCGCC
GAGGCACAGA TCCTTGCCAG CCTTGATCAC CCGCACATCG TCCGGGTGTA CGACTACGTC
GAGACCGACG ACCTGCGCCT GATCGTGATG GAGATGCTGG CTGGGGGGAC CCTGACCCGC
CACCAGGCGA GCATCAGCCA GCCGGGGGCC TGTGCGGTCG GCCTGGCCGC GGCCGGAGCC
TTGTCGTGTG CGCATTCCAG GGGTGTGCTG CACCGCGACA TCAAGCCCGA CAACATGCTG
TTCGACGCCG CTGGACTAAT CAAGGTAGCC GATTTCGGAA TCGCCAAGCT GGTAAGCGGG
TCGGCGGCCA CCGCGAGCTC AGTGGTCGGA ACACCGCTAT TCATGGCCCC CGAGCAGATC
GCGGGGGGCC GGCTGGGGCC GACCACAGAT CTGTACGCAC TCGGAGTGGT GCTGTATCTG
CTGTTGGCCG GTACAAGGTC GTTCGGCCCA GCGGCGGCTA CCCAGCCTCC GTGGCCAAAC
CCCATCGACC AGCACATCCT GCCGCCGCCG ATCGGAGTAC CCGAGCCGGT GGCAACGGTG
GTTCTACGCA TGCTTTCCCC GGTCCCAGCA GACCGGCCGC CATCCGCGCA CGCTTTTGCC
GTCGATCTCG CCCAGGCCGC CGCCACCGCA TACGGCCCGG GGTGGATCGC CACGGCTGGG
GTGTCGCTTC GGCTCGATGA CGATGTGCGC GCCGCCGCTG AACGCCCGGC GAGACTCACC
CGACTGCCGC CATCATCCGC CGCCGACGAC ATGTCGCCTT CGCATCCCGC GGACAGCAGT
CCGGAGGATG GGCCCGCCAG CGAGAGCGTG GCCAGCCGTG GGCGCCACGC ACGGTCGCAT
CCCCGGCTGG CCTCGTCGCC GCGCCACCGG CTCGCCGCCG GCGTGGTGCT CCTCCTACTC
GCCGTTGCCG GCACCGTCCT TATCCCTGTC GCTGCGCGCA GTGGGAATCC GAACGAGCCC
AGTTCTCACC CCGTCGCACT GGGCGCCTCC GCCAAGGAGA CGCCGGCTCC CGCCGCCCAG
GCGCTGGGCC AGCCCTTGAC GGATCACACC GACTGGGTGG CCTCGGTGGC GTTTTCCCCG
GATGGCCACA CCTTGGCCAG CAGCAGCAAG GACACGACAG TCCGGCTCTG GGACATCACC
GACCGCACCC GTCCCCACCC CCTCGGCCAA CCCCTCGCCG GCCACACCCT CGGAGTAATG
TCGGTGGCTT TTTCCCCGGA CGGCAACACT CTGGCCAGCA GCAGCAGGGA CACGACAATC
CGGCTCTGGG ACATCACCGA CCGCACCCGT CCCCACCCCC TCGGCCAACC CCTTACCGGC
CACACGGACG CCGTCACCTC GGTGGCGTTC TTCTCGGACG GCCGCACCCT GGCCAGCAGC
AGCAGGGACA CCACAATCCG ACTCTGGGAC ATCACCGACC GCACCCGCAC CCGCCCCCTC
GGCTCGCCCC TGTCGGGCCA CAGCGACTGG GTCACCTCGC TAGCGTTAAC CATGGACGGC
CGCACCCTGG CCAGCAGCAG CCTCGACAGC ACAGTGCGGT TGTGGAACAT GGCCGACCGA
TCTCATCCCC AACCCATCGG CCTACCCCTT ACCGGACATA CCGGCGGAGT GAACTCAGTA
GCGTTCTCCT TGGACAGCCG GACCCTGGCC AGCAGCGGCA GGGACACCAC AATCCGACTC
TGGGACGTCA CCGACCGGTC TACTCCCCGT CTGCTCGGGG CACCGATCAC CGGCCATGCC
AACACCGTCG GACCGCTAAC GTTCTCTCAG GACGGCGACA CCCTGGTCAG CGGAAGCTAC
GACGACACCG TACGAATATG GGACGTGACC GACCGGTCCC ATCCCCGCCT ACTTGGTCTG
CCCCTGACCG GTCACACTGA CTGGATTTGG TCGGTAGCGT TGTCGCCGGA CGGCCAAACC
CTTGCCAGCG GCAGCAAGGA CAACACAATA CGGCTGTGGG CCCTTCCCTG A
 
Protein sequence
MIVDRAHVAA ALPGYELGDQ IGAGAFGLVL AGWHRRLGRD VAIKVLAARD RSGLTASFAA 
EAQILASLDH PHIVRVYDYV ETDDLRLIVM EMLAGGTLTR HQASISQPGA CAVGLAAAGA
LSCAHSRGVL HRDIKPDNML FDAAGLIKVA DFGIAKLVSG SAATASSVVG TPLFMAPEQI
AGGRLGPTTD LYALGVVLYL LLAGTRSFGP AAATQPPWPN PIDQHILPPP IGVPEPVATV
VLRMLSPVPA DRPPSAHAFA VDLAQAAATA YGPGWIATAG VSLRLDDDVR AAAERPARLT
RLPPSSAADD MSPSHPADSS PEDGPASESV ASRGRHARSH PRLASSPRHR LAAGVVLLLL
AVAGTVLIPV AARSGNPNEP SSHPVALGAS AKETPAPAAQ ALGQPLTDHT DWVASVAFSP
DGHTLASSSK DTTVRLWDIT DRTRPHPLGQ PLAGHTLGVM SVAFSPDGNT LASSSRDTTI
RLWDITDRTR PHPLGQPLTG HTDAVTSVAF FSDGRTLASS SRDTTIRLWD ITDRTRTRPL
GSPLSGHSDW VTSLALTMDG RTLASSSLDS TVRLWNMADR SHPQPIGLPL TGHTGGVNSV
AFSLDSRTLA SSGRDTTIRL WDVTDRSTPR LLGAPITGHA NTVGPLTFSQ DGDTLVSGSY
DDTVRIWDVT DRSHPRLLGL PLTGHTDWIW SVALSPDGQT LASGSKDNTI RLWALP