Gene Franean1_6100 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFranean1_6100 
Symbol 
ID5674421 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. EAN1pec 
KingdomBacteria 
Replicon accessionNC_009921 
Strand
Start bp7426215 
End bp7428089 
Gene Length1875 bp 
Protein Length624 aa 
Translation table11 
GC content78% 
IMG OID641244952 
Productserine/threonine protein kinase 
Protein accessionYP_001510350 
Protein GI158317842 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0281 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCGGACC GATCGGCACG CCGGACGCGG GGTGCCCCCC TGCAGCCGGG CGATCCGACG 
TTGATCGGGC GCTACGAGGT CGTCGGCCGG CTGGGCGCCG GCGGCATGGG CACCGTCTTC
CTGGCCCTCG ACCAGAGCGG CCGGCACGTA GCGATCAAGG TGATCCGCTC GGATCTCGCG
GTGGATCCGG AGTTCCGGGA GCGGTTCGCC GACGAGGTCG CCGCCGCCCG CCGGGTGGCC
CCCTTCTGCA CCGCGCAGGT CCTCGACGCC GACCCGCACG CCCGCCGTCC CTACCTGGTC
ACCGAGTTCA TCGACGGTGC CCGGCTCGAC GAGGTCGTCG CCGAGTCCGG GGCGCTGCCG
CTGTCCACCC TGCAGGGGGT CGCCGTCGGG GTGGCCTCCG CGCTGACCGC GATCCACGGC
GCCGGGATCG TGCACCGTGA CCTCAAGCCC AGCAACGTGC TGCTGTCGTA CTCCGGACCC
CGGGTGATCG ACTTCGGGAT CGCGCGGGCA CTCGACGCGG CGGGCGGGCG CACCCAGTCC
GGGCTGGTCC TGGGCTCGGC CGGGTGGATG GCCCCGGAAC AGATGGAGGG CACGGCGCCG
GTGGGCCCGG CGACCGACGT CTTCGCCTGG GGTCTCCTTG TCGCCTACGC CGCGGGCGGG
AGCCACCCCT ACGGCGACGG CACCTATCTC GAGATGTCGG AGCGGATCCT CACCGGCCAG
CCCGACCTGC GGCCCGTCCC GGCACAGCTG CGCGACGTCG TCGCCGCGGC GCTGCGACGC
GACCAGCGGG TGCGGCCGTC CGCCGAGCAG ATCCTGCTGA ACCTGCTCGG TGACCGTGGC
CGGAGCGGCA ACACGCGCGC CGTGGCCAGT GAGGTCCTCG ACGGCACCTG GCCACGCGGC
CGGTCCGCCG CCGCGGCGGC GGCCGGTTAC GTGGCCGGCG CGGGCGACGC CTTCGGCGCG
GCAGGTGCCG CCGGAGCCGG TGGTGGGCCC GCGGCGACCC GTGTGGCGGG CGGGACGGCC
GGCGCGGGCC AGGCCCCCGG CCGACCGGCC CCGACCCGCG TCGCCGGCCC CGCGGACGTC
AGTAGCCGCT GGTGGGAGGG CCCGCCGGCG CCGGCCTCGG GCCGGCCGGG CGACGGTGCG
GCCCCGGGCG GGTTGCCCGG TGCCCGCCAG GCATCCCCGC CGGTGCCGGG CGGGCAGCAA
CCCGCCCGCC GCCGCCGTAA GGGCGGATTC GTCCAGTACG GGGACGAGCC CGAAGCCGCC
CGACCGGACG GGCGGTACTC GGCAGGCGCG GCCGGCGCGG CTGGAGCCGG CGCGGCGGCG
GCCGGTGCCG GCGCGGCCGG TCCGCCGCAG GCCCAGGGTG GCGGCGCGCG GCCGGCGAGT
GCCAGGCCGG CTCCCCGTGG ATACGCACCC GCCGCGCCAC CCCCGCCCGC GCCTCGCCCG
GCGCCGCGAC CAGCTCCCCG GCCGGCGCCC CCGCCCGCTC CGGCGCCGGC GCCGTACTAC
GGCGACGCAC ACCGCCCGGC ACCCCCCGCG CCGGCACCCC CCGCGCCGGC ACCCCCCGCG
CCGGCGCCGT ACGCGCCGAG CCGCCGGCCG CGCCGGCGGT GGCGCCTGCG GATCCCGTTC
AAGAAGACGA TCATCTTCGT GGCGCTCGTG CTGCTCCTGC TGTCGGCGGC CGACCAGATC
GCCACGATGG TGGACGACCA GCGCCAGCGG CTCTGGGACA GAGTCGTCAG CACATTCCGG
GACGACATCG GCGGCCGGAT CGACGACCTG TGGGGCAAGA CCGACAACCT GCGGAACCAG
ACCCCTGACC TGGGCGACCA GGTTCCCAAC CTGCCCACCC AGCTGCCCGG CGGGCTCGGG
AACCAGAACG GCTGA
 
Protein sequence
MADRSARRTR GAPLQPGDPT LIGRYEVVGR LGAGGMGTVF LALDQSGRHV AIKVIRSDLA 
VDPEFRERFA DEVAAARRVA PFCTAQVLDA DPHARRPYLV TEFIDGARLD EVVAESGALP
LSTLQGVAVG VASALTAIHG AGIVHRDLKP SNVLLSYSGP RVIDFGIARA LDAAGGRTQS
GLVLGSAGWM APEQMEGTAP VGPATDVFAW GLLVAYAAGG SHPYGDGTYL EMSERILTGQ
PDLRPVPAQL RDVVAAALRR DQRVRPSAEQ ILLNLLGDRG RSGNTRAVAS EVLDGTWPRG
RSAAAAAAGY VAGAGDAFGA AGAAGAGGGP AATRVAGGTA GAGQAPGRPA PTRVAGPADV
SSRWWEGPPA PASGRPGDGA APGGLPGARQ ASPPVPGGQQ PARRRRKGGF VQYGDEPEAA
RPDGRYSAGA AGAAGAGAAA AGAGAAGPPQ AQGGGARPAS ARPAPRGYAP AAPPPPAPRP
APRPAPRPAP PPAPAPAPYY GDAHRPAPPA PAPPAPAPPA PAPYAPSRRP RRRWRLRIPF
KKTIIFVALV LLLLSAADQI ATMVDDQRQR LWDRVVSTFR DDIGGRIDDL WGKTDNLRNQ
TPDLGDQVPN LPTQLPGGLG NQNG