Gene Francci3_1643 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_1643 
Symbol 
ID3905922 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp1975832 
End bp1977436 
Gene Length1605 bp 
Protein Length534 aa 
Translation table11 
GC content74% 
IMG OID637878981 
Productserine/threonine protein kinase 
Protein accessionYP_480748 
Protein GI86740348 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.0994433 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGACTGTGG ATGGTGTCGG CATCGGCCCG CTCGTCGCCG GGCGGTACCG GCTCGTCGAA 
CGGATTGGCT CCGGTGGGAT GGGCACCGTG TGGCGGGCCC ACGACGACGT GCTCCGGGTC
GAGGTGGCGA TCAAGGAGAT CCGGGTCTCC GCCGACCTCG ACGACGACGA GCGTGCCGCC
GGAGTGGAGA CGGCGATGCG CGAGGCCCGC AACGCCGCCC GGCTGCGGGG GAACCCACAC
GTGGTCACCG TCCATGACGC CGTGGAACAC GACGGCCTGC CGTGGATCGT GATGGAGCTG
GTCCGGGCCG GCACACTCGC CACGACGGTC AACCGGGACG GTCCCCTGCC GCCCGAGCGT
GCCATCCAGG TCGGGCTCGC GGTCCTCGAC GCCCTGGTCG CGGGTCAGCG GATGGGCGTG
CTGCACCGGG ACGTCAAACC ATCGAACATC CTGCTGGCCG ACGACGGCCG GGTGCTGCTG
ACCGACTTCG GCATCGCCAC CCACGCCGCC GACCCGACCC TCACCGGCGG CATCGGCAGC
GGCGGGACGC CCGCTTACAT GGCGCCCGAA CGCCTCCTCG GCGGCCCCGC GACCCTGGCC
GGCGACCTGT TCGCCCTCGG CGCCACCCTC TACTTTGCCG TCGAGGGGGT CTCACCGTTC
CAGCGCGACA CGCTGCCCAC CACTATCGGC GCCGTGCTGC ACGCCGATCC GCCACCCTTC
CTCCGGGGCG GCAGGCTGTC GGCAGCGATC GCCGGGCTGC TGGCCAAGAA CCCGGCGAGC
CGGCTGCGCG CGGAGGGAGC GCAGGCGTTG CTGACCTGGG CCGCGTCCCA CCCAGCCGAT
TCCGCCCCGG CATCCCTGGT ATCCCCGGCA TCCCTGGTAT CCCCGGTATC TCCGGCATCT
CCGGCATCCC CGGCCGCGTC ATTGCCGCCG TCCCCGGCGC CGGACGCGGT GATCCGGCGC
CGGCACGGGT CGCGACCCGT GCCGTTCCCG CCGGCCTGGC CAGGGACGGC GCGGCCCCCC
GGCGCGGTCG GGTGGGTCCG GTCGTGGCGG GCACCCCGGC TGATGGCCGG GGCGATGATC
CTCCTGGTGC TGCTCGCCGC CGTGGCCGCC GGGGCGTACC AGCTGTTCGG CGCGGACGAC
GGCGGCGACG GCGACCGGCG TCGTGCCTCG CCGCCGACGG TCGCCACGCG GGATCCGGCG
GGGGTCGGGT CGCAACCGGG CGCCGAGGCT GGCGAGGCGC TGCCCCCGGG CATGATCGGC
AGTTGGAGCG GATCGGTGAC GCAGGCGTTC GTGCACTTCA ACGCCGAGCT GGTGCTGCGG
GGCGGCCGGA TCGGCGAGGT GATCGGCACG AGTGCCTACC CGGAGAGTGG ATGCGCCGGC
GAGCTCGTGC TGCGGGGGGT GTCCGGCGCC TCGGTCCGGC TCGAGGAACG TCTCACCCGG
GTCGGGGCGT TGTGCTTCGC CGCCACCTGG CTGGACCTCG TGCTCCACGG CGACGGCACG
CTCGACTGCT CGTATCCGGC GACCGAGATC AGTTCGGCGG GGCAGGCGAC CATGCGCCGC
TCGGCGCCCC CCATGTCCCC ATCGTCCCCC GCGCCTCCGG GCTGA
 
Protein sequence
MTVDGVGIGP LVAGRYRLVE RIGSGGMGTV WRAHDDVLRV EVAIKEIRVS ADLDDDERAA 
GVETAMREAR NAARLRGNPH VVTVHDAVEH DGLPWIVMEL VRAGTLATTV NRDGPLPPER
AIQVGLAVLD ALVAGQRMGV LHRDVKPSNI LLADDGRVLL TDFGIATHAA DPTLTGGIGS
GGTPAYMAPE RLLGGPATLA GDLFALGATL YFAVEGVSPF QRDTLPTTIG AVLHADPPPF
LRGGRLSAAI AGLLAKNPAS RLRAEGAQAL LTWAASHPAD SAPASLVSPA SLVSPVSPAS
PASPAASLPP SPAPDAVIRR RHGSRPVPFP PAWPGTARPP GAVGWVRSWR APRLMAGAMI
LLVLLAAVAA GAYQLFGADD GGDGDRRRAS PPTVATRDPA GVGSQPGAEA GEALPPGMIG
SWSGSVTQAF VHFNAELVLR GGRIGEVIGT SAYPESGCAG ELVLRGVSGA SVRLEERLTR
VGALCFAATW LDLVLHGDGT LDCSYPATEI SSAGQATMRR SAPPMSPSSP APPG