Gene Francci3_3136 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3136 
Symbol 
ID3903933 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp3708975 
End bp3710693 
Gene Length1719 bp 
Protein Length572 aa 
Translation table11 
GC content75% 
IMG OID637880457 
Productserine/threonine protein kinase 
Protein accessionYP_482222 
Protein GI86741822 
COG category[K] Transcription
[L] Replication, recombination and repair
[R] General function prediction only
[T] Signal transduction mechanisms 
COG ID[COG0515] Serine/threonine protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.295772 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCGTTCCC GCCTGCGGGA CGCCGGCTCT GGAGAGCTCT GGTCGGGTCA GTTCGACCAG 
TCCGGAATCG AGTGCGTGGT GCGGCGGGTG CGGTTGGCGC CGGACCCGGT GCTGCGTTCC
GCGGCGGTCA TCGCGGCCCG GGCCCTCGTG GACCTGGAGC ATCCCCATCT GGTGCCGGTC
GTGGCGGTGT TGCCGACCGC GGAGGGCCTG GCGTTGATCA CCGAGCCGGT CGTCGGCGGT
GTCAGCCTCG CGCGGCTGCT CGCCGCCCGC GGGGACCTGG ACCCCGGTGA GGTCGTCACG
ATCGGTCTGC CGATCGCGCA GGCCCTGGCC GCCGCGCACG CCGTCGGCGT CGTGCATGGC
CGGCTCGAGC GTGCGGACAT CCTGTTGGAG CCGAACGGTC GTCCGGTTCT GATCGGGCTC
GGGGTCGCGG CGCTGGCCGA CGCCGCCAGA CCGGATCCGA TTCCACCCGA GGCCGCGTCC
GCGGACGTCC ATGACCTGGC CACTCTGCTG CTCGGGGCGA TGCGTGAGGC CACCGGGCCA
GACGCGGCCG CGGTGGCGGT GGCGGTGGCG ACCGCCATGA TCGACGACCC GCGTCGGCGA
CCGAGCGCCC TGGAACTCGC CGCGTCGCTG GCCCGCAGCG CGACTCCGCT CCCGGTCCGC
CTGGCGGGCG GCGGCGAGGT GGGCGGCGGC GAGCCGGGTG GCGCGGCGCG GCGCGGTCCC
TCGCCGACCG ACACCCTGCC GGCGATCCCC TGGGTGGACC CGCCCGACGA TCCGGCCCCG
CCCGACCGGG CCATCACCGG AGCGACGGCC GTCAACGCCG CGGAGCTGCT CGACTCGTTG
CCGCCACCGC CCCGACGATC GACCCGTGCG ACGACCAACC CTCGGAAGGC CGCCCGCGGT
GGCGGTCGGG GCTCCGGGGG CAAGGGGTCG GCCACCCCGG GTGCGGCTGC TCCGGGGCCG
CGGTCACCGC GTTCCGGCGG ATCCACCGGA AGCGGCCAGG GGACGCAGCG AGGTCCGCGC
GGGCACACGA CCGCCTCGGG CTCCCCGGGT CCCGCGAGGC CCCCGGGGGG CCTCGGCTCC
CCGGGCCGGA GCACCCGGCC GACCTCCGCG CGCCGACCGC GTCGGGTGGG AGCCCGCCGC
CAGCGCTTGC TGTTCCCGGT GACCGCCGGG CTGGGCCTGC TGGTGGTCGT GGCGGCCGTC
GCGTTGCTGC TGGCCAGCCC GGAGAAGGAC GATGCACCCG GCGCGGCCGG GACGTCCGTG
CCACCAACCG GTGCAGCCTC CGCGGCCGGG CGGGGACCGA CGGCGTCGCC GGTCGCCAAT
CAGTCGCCCG AGCAGGTCTG GCGTTCGGTA CTCGCCGAGT TGAACACCGC GCGCAGCCGT
GCGTTCGAGC GGGCTGACGA GAGCCTCCTG ACGGAGTCTG ATGCCGCGGG TAGTGAGTTG
CACGCTTCGG ATATCGCGCT GATGAGGCAG GTGGTCGCGA GGGGGGCGCA TGCCTCGGCA
CTGCGCAGCG ACATCCTCGA TCTCAAGGTC CGGGTCGAGG AACCGGACCG CACCGTGCTG
CGGGTCACCC AACGCCTGAA CGCCTACGAC TTCCTCGACG CCGGGGGGAA GGTGCTGGCG
CATCAGGCGG CGAAGTCGCC GGAACGGCGT GACCTCACCC TGATCCGTAC CGGCTCCGGT
TGGCGGGTCT CGGAGAACGT CCCGGTCACG GCCGGCTGA
 
Protein sequence
MRSRLRDAGS GELWSGQFDQ SGIECVVRRV RLAPDPVLRS AAVIAARALV DLEHPHLVPV 
VAVLPTAEGL ALITEPVVGG VSLARLLAAR GDLDPGEVVT IGLPIAQALA AAHAVGVVHG
RLERADILLE PNGRPVLIGL GVAALADAAR PDPIPPEAAS ADVHDLATLL LGAMREATGP
DAAAVAVAVA TAMIDDPRRR PSALELAASL ARSATPLPVR LAGGGEVGGG EPGGAARRGP
SPTDTLPAIP WVDPPDDPAP PDRAITGATA VNAAELLDSL PPPPRRSTRA TTNPRKAARG
GGRGSGGKGS ATPGAAAPGP RSPRSGGSTG SGQGTQRGPR GHTTASGSPG PARPPGGLGS
PGRSTRPTSA RRPRRVGARR QRLLFPVTAG LGLLVVVAAV ALLLASPEKD DAPGAAGTSV
PPTGAASAAG RGPTASPVAN QSPEQVWRSV LAELNTARSR AFERADESLL TESDAAGSEL
HASDIALMRQ VVARGAHASA LRSDILDLKV RVEEPDRTVL RVTQRLNAYD FLDAGGKVLA
HQAAKSPERR DLTLIRTGSG WRVSENVPVT AG