Gene Franean1_2000 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_2000 
Symbol 
ID5670401 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp2405169 
End bp2406962 
Gene Length1794 bp 
Protein Length597 aa 
Translation table11 
GC content76% 
IMG OID641240921 
Productserine/threonine protein kinase 
Protein accessionYP_001506343 
Protein GI158313835 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0547084 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAACCCC TCCGCGCGGA TGATCCCCGG ACCACCGGCG GATACCGGCT GCTCGGCCTG 
CTCGGCGCCG GCGGGATGGG CCGGGTGTAC CTGGCCCGCG GCCCGGGCGG GCGCACCGTC
GCCGTCAAGG TGATCCGGCC GGAGTTCGCC GGCGATCCGA CGTTCCGCGC CCGGTTCCGC
CGCGAGGTCG AGGCCGCCCG CCGGGTTGGC GGCGCCTGGA CCGCGCCGGT GATCGACGCC
GACCCGGACG CCGAGCAGCC CTACCTGGTC ACCGGTTACG TCCCGGGACC GTCCCTGCTG
GAGGCGGTGC GCCGGCGCGG CCCGCTTCCG GTACCGACCG TGCGGGCGCT GGGTGCCGGC
CTCGCCGAGG CGCTGAGCGC CGTGCACGCC GCCGGCCTGG TGCACCGCGA TCTCAAGCCC
TCGAACGTGC TGCTCGGCCT GGACGGGCCA CGGGTCATCG ACTTCGGGAT CTCCCGCGCG
TTCGACGCCA CCGTGCTCAC CCATTCCGGG TCCGCGATCG GCTCGCCGGC CTTCATGTCA
CCCGAGCAGA TCGGCGGGCA GGAGGTGGGG CCGGCCAGCG ACGTCTTCGC CCTCGGCTCG
GTGCTCGCCT TCGCCGCGAC GGGATCAGGC CCGTTCTCGG GCAGCGGGAT GCCCGCGGTG
ATGTACGGCA TCCTGGCGGG CGAGCCGCGG CTCGACGCCG TCCCGGCCGA GCTGCGCGGC
ATCGTGGACG CCTGCCTGCG CAAGGCGCCG GGTGAGCGCC CGGGGCCGCT CGACGTGCTC
GCGGAGCTCG TCCCGGGCGG CGGCGCCGCC GAACTGATCA CCGCTGGATG GCTGCCGCAG
GACCTGGTCA CCGGGCTGAG CCGGCAGGCC GTCGCGCTCC TCGACCTGGA CGTCCCCGCC
ACCACCACCG TGACGAACGA CCACGCCACA CCGGAAGATC ACAACCGGGC ACTGCCATGG
CCGCCGGAGC CGGCCGCACC GCCAGCCACA GCAGCCACAG CGGCCACAGC GGCCGCACCG
CTAGCCACAG CGGCCGCACC ATCGGGCGTA TCGCGGGCCG CACCACCGGG CACACCGGCC
GCACCATCGG GCACCCCGGT CGGGCCCGGC GCCCCGCCGG CCGGTGGGCG GCGCGCCCTC
GTGACCGTGG CCGCCCTGGG TGTCGTGTGC CTCGCCGTGA TCGTGACGGC CGTCCTGCTG
GTTGCCCTGC GCGGCTCGGA CGGCGGCTCG GACGGCGGCG GCGCGGACGC CGGCGTGAGC
CCCGGGGCCA CCGCCGGGAT CGGCTCCCTG ACCGACCTCC TCGACCAACC GACCTCGAGG
CCGACCGTCG GCGGGCCGCC GGCGAGCTCG GGTTCGGCGC CCTCGGGCGG GTCGGGTCCC
ACGGTGCCGG GGGCCCTGCC GGCCGGGTAC GCCGGCACCT GGGAGGGAAG CATCACCTCG
CGGCTGGGGG TCGTGCAGGA CGTCGTGATC ACGCTGCGGC CCGGCGAGAG TGGTCAGACG
GTCGGCCACT CCGAGGTCAC CCTGGTCGGG CTGGGGGCGT TGGGAGGTGA CGCGTCGATC
CGGTGCGTCG GTGACCAGCA GCTCGTGGGC ATCAGCACCG CGGCGGGCTC CAGGCCCGAG
GTGGTCCTGC GCGACATCGG GGGCGCGGGC GACAACCCCA CCCTGCTGGG TCTGCCGGTG
TGCACGAGCG GCGGCACGAC GAGGCTGCGC CTCGCGGCGG ACGGCGCCCT CGACTACCAG
TCCGAGGACG AGGCCGGCGG GCGCCCGGCG GGAAGCCTGC GCCACCGCCC CTGA
 
Protein sequence
MEPLRADDPR TTGGYRLLGL LGAGGMGRVY LARGPGGRTV AVKVIRPEFA GDPTFRARFR 
REVEAARRVG GAWTAPVIDA DPDAEQPYLV TGYVPGPSLL EAVRRRGPLP VPTVRALGAG
LAEALSAVHA AGLVHRDLKP SNVLLGLDGP RVIDFGISRA FDATVLTHSG SAIGSPAFMS
PEQIGGQEVG PASDVFALGS VLAFAATGSG PFSGSGMPAV MYGILAGEPR LDAVPAELRG
IVDACLRKAP GERPGPLDVL AELVPGGGAA ELITAGWLPQ DLVTGLSRQA VALLDLDVPA
TTTVTNDHAT PEDHNRALPW PPEPAAPPAT AATAATAAAP LATAAAPSGV SRAAPPGTPA
APSGTPVGPG APPAGGRRAL VTVAALGVVC LAVIVTAVLL VALRGSDGGS DGGGADAGVS
PGATAGIGSL TDLLDQPTSR PTVGGPPASS GSAPSGGSGP TVPGALPAGY AGTWEGSITS
RLGVVQDVVI TLRPGESGQT VGHSEVTLVG LGALGGDASI RCVGDQQLVG ISTAAGSRPE
VVLRDIGGAG DNPTLLGLPV CTSGGTTRLR LAADGALDYQ SEDEAGGRPA GSLRHRP