Gene Franean1_1111 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_1111 
Symbol 
ID5669524 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp1327123 
End bp1329348 
Gene Length2226 bp 
Protein Length741 aa 
Translation table11 
GC content74% 
IMG OID641240043 
Productserine/threonine protein kinase with PASTA sensor(s) 
Protein accessionYP_001505471 
Protein GI158312963 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.621151 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGCGCAAAC TGGGTGCACG GTACGTCCTG CACGAATTGC TCGGGCAGGG AACGGCCGGC 
CAGGTCTGGC GCGGGGCCCA TGTCCCCGAC GGAGAGCCCG TCGCCATCAA GGTACTTCGG
CCGGAGCTCG CTCATGACCC CGAGATCGTC GACCGGTTCC TGCGCGAGTG GGACCTGCTC
GTCGAGCTCG ACAGCCCTGA CCTGGTGCGG GTTCGTGACC TGATCAGAGA GCCCGGCACG
CCCCTGGCGA TCGTGATGGA CCTCGTCGAG GGCATCGACC TCAGGGCACA CCTCGACCAG
AGCGGGCCAC GGCCGGTGAC CGAGGCGGTC AACCTCGTCG TCGGGCTCCT CTGGGCGCTG
GACAGCGTGC ACGCGGCCGG GATCATCCAC CGCGACGTCA AGCCCGAGAA CGTCCTCATC
GACACCTCGG ACCCGCGCCG CCCCTACGTC CGGCTGACCG ACTTCGGCGT GGCGCGGATG
GTGCACACCC CCACCCGCGC CTCGCTGACC GGCCCGATCG GCACCCCGCT CTACATGGCG
CCGGAGCTGA CGACCGACGC CCCGCCGACG CCGGCCGTCG ACATCTACGC CGCCGGGATC
GTGCTCTACG AGCTCATCGC CGGCAGCCCG CCTTTCGACG AGGCGCACCC GGCCGACATG
GTGCGGGCGC ACCGCGAGGA CCAGCCGCTA CCCATCCAGG GCGTCCCGCC GGCCCTGTGG
GACGTCCTGT CGTCGATGCT CGCGAAGTCG CCGCGCCAGC GGCCCGCCTC CGCCGCGGAC
GCCGCGGAGG ACCTGATCGA GGCGCTCGAG AACGACCGCG ACCGCGACGA CCCGGATTTC
GACTCCGACC AGCTCGACCT CGACAACCGC CGTCCCGACG ACCGGAACCG CGAGTTCGAC
GACCGGTCCG CGGGTCAGCG TGGGCGCGAC TCCGGCACCG GCCGCGGCGC CGTCCCCGAA
CCTCGCCGCG ACGCGCCCGG GAAACGGGAA GCGGCCTTCG ACGCGGCGCA GACCCGGATC
GGCTCGGCCG CCGACTGGGC GGAGGACGAG CGCGGCCGCC AGCCGGCCGC GGCGGGGCGC
CGCCTCGCCG GCGCCGGCGC CGCGGGTGTG GCGACCAGCG TCGCGGGTGC GGGTGCGGGT
GGCCCGGCCG GCGACTGGAA CGACGCCGAA CACACCCAGA TCGCCGGGAT GCCACCCGTC
CGGCCGGACT GGAACGACGC CGAGCACACC CAGATCGCCG GGATGCCACC CGTCCGCGCC
GACTGGAGCG ACGACGACAC CGGTGGCCGG CCGGCGGTCC GCGTCCCGGC CCGCTCCACG
GGCTCCGCGT CCGACCGCAA CACGGTGATC TCCGCGATCC CGGCGAACAA GCAGCCGGCC
CCGGGCAGCT CGGGCTCGTC AGGCGGACCG GGCGGGCGTT CTGCCGCCGA CCGCCGCCGC
CGCAGCCGCA TCGCGGCCGG AGCCGGCCTG GTCGTCGCGC TGGTCGCCGG CGCCGGTGGC
TGGGCACTGG CAGCGGCCGG CGAGAGCGAG AGCGCGCTCA CCGCGGACAG CGGCTCGCAG
GTCGTCACCG ACCCGTCGAT CACCGCGACG CTGCCCGGTG GCGCCCCGAT GCCCCCGGGG
ATGGACCCGG GCACTGTCCC CACAAGCTCC ATAGCAGGAA CGACGCCGCA TCCAAGTACC
TCCCCGAAGC CTGGCCAGAG CGCCACGCCC GGGCCGGCCA CCCCCACCCA GCCGGGTCAG
ACCACGGCCC CGCCGGACGC CAGCGCCGCA CCGACGCCGA CACCGACGAC CAAGGAAGCG
ACGGTCCCCA ACGTGGTGGG CCAGAGCCAG ACGGCGGCCA CCAACACCTT GACGGGCAAG
GGCTTCACGA ACGTCACGGC GACCGAGGTC TGCCAGAAGG GCAAGAACGG CGGCGTCGTG
CTCGACCAGG GCCCGAACGC GGGCAGCGTC GTCCCGGTCA CCACGAAGGT AACGCTCACG
GTCCAGGCAA CGAACTGCGT CGAGGTCCCC GCCGTCGCGA ATCAGACGCT CGCCGCCGCC
CGCAACGTGC TGATCGGCGC CGGTCTGGGT GTCCTGGACG GCAACGGCGG TTGCCAGAAC
GGTCCCGGCA CCACTGCGGC GGGAACGAAC CCCGCCGCCG GGACGATGAT GCGCAAGGGC
GACAGCGTCT GGCTCGAGCC CACCTGCGCG AAGCCGCCCC CCGCCACCGC CGCCGCCGCC
AAGTAG
 
Protein sequence
MRKLGARYVL HELLGQGTAG QVWRGAHVPD GEPVAIKVLR PELAHDPEIV DRFLREWDLL 
VELDSPDLVR VRDLIREPGT PLAIVMDLVE GIDLRAHLDQ SGPRPVTEAV NLVVGLLWAL
DSVHAAGIIH RDVKPENVLI DTSDPRRPYV RLTDFGVARM VHTPTRASLT GPIGTPLYMA
PELTTDAPPT PAVDIYAAGI VLYELIAGSP PFDEAHPADM VRAHREDQPL PIQGVPPALW
DVLSSMLAKS PRQRPASAAD AAEDLIEALE NDRDRDDPDF DSDQLDLDNR RPDDRNREFD
DRSAGQRGRD SGTGRGAVPE PRRDAPGKRE AAFDAAQTRI GSAADWAEDE RGRQPAAAGR
RLAGAGAAGV ATSVAGAGAG GPAGDWNDAE HTQIAGMPPV RPDWNDAEHT QIAGMPPVRA
DWSDDDTGGR PAVRVPARST GSASDRNTVI SAIPANKQPA PGSSGSSGGP GGRSAADRRR
RSRIAAGAGL VVALVAGAGG WALAAAGESE SALTADSGSQ VVTDPSITAT LPGGAPMPPG
MDPGTVPTSS IAGTTPHPST SPKPGQSATP GPATPTQPGQ TTAPPDASAA PTPTPTTKEA
TVPNVVGQSQ TAATNTLTGK GFTNVTATEV CQKGKNGGVV LDQGPNAGSV VPVTTKVTLT
VQATNCVEVP AVANQTLAAA RNVLIGAGLG VLDGNGGCQN GPGTTAAGTN PAAGTMMRKG
DSVWLEPTCA KPPPATAAAA K