Gene Francci3_3788 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3788 
Symbol 
ID3906073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4539356 
End bp4541080 
Gene Length1725 bp 
Protein Length574 aa 
Translation table11 
GC content73% 
IMG OID637881115 
Producthypothetical protein 
Protein accessionYP_482868 
Protein GI86742468 
COG category[R] General function prediction only 
COG ID[COG0661] Predicted unusual protein kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value0.75918 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.724918 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGTCCGAGA TCCCACGGCG GGCCGTCGTC CGCACCGCCA AGCTGGCCAC GCTTCCCATC 
GGAATAGCCG GCCGAGCCAC CCTCGGCGTC GGCAAACGCA TCGGTGGTCG CCCCGCCGAG
GCCGTCGCCT CCGAGCTCCA GCAGCGCACC GCCGCGCAGA TCTTCCGGGT GCTCGGGGAG
CTCAAGGGTG GCGCCATGAA ACTGGGCCAG GCGCTGTCCG TCTTCGAGGC GGTGCTTCCC
GACGACGTCG CCGGGCCATA CCGGGCGGCG CTGACCAGGC TGCAGGAGGC GGCGCCCCCG
CTGCCGGCGG CAGTGGTGCA CCGCGTGCTC GCCGAGGAGC TCGGAGCGGA CTGGCGCTCG
CTGTTCACGA GCTTCGACGA CGTTCCGGCG GCGGCGGCCA GCATCGGCCA GGTGCATCGG
GCGGTCTGGG CTGACGGCCG AGCCGTCGCG GTGAAGGTGC AGTATCCGGG AGCGGGTCCC
GCTCTGCTCG CCGATCTCAC CCAGCTCGGC CGGGCCGCTC GGCTGTTCGG CGCGGTCACT
CCCGGATTGG ACATCAAGCC GCTGGTCGAG GAGCTCAAGG CCCGAATCGC CGAGGAGCTC
GACTATCGGT TGGAGGCCGC CTGGCAGGGG GCCTTCGCCG AGGCGTACGC GGACGAGCCG
GACGTCGTCA TCCCCCGGCC GCTGGCCGGT TCTGGCCGGG TGCTCGTGAG CGAGTGGATC
GAGGGAATAC CCCTGTCCGT CATCATCGCG GACGGTACCC CGCAGCAGCG CGACACCGCC
GGCCTGCTGC TCGTGCGGTT TCTCTATTCC TGCCCGGGTC GCGCCGGTCT GCTGCACGCT
GATCCGCATC CCGGCAACTT CCGCCTGCTG TCCGACGGGC GGCTCGGCGT CCTGGACTTC
GGCGCCGTGA ACCGCCTGCC GGACGGCCTG CCGGCGCCGA TCGGCCGGCT GGCTCGACAG
ACCCTGGCCG GGGACGCCGA CGCCGTCGAG CAGGGACTGC GCCGCGAGGG CTTCATCCCG
CCGTCGGCCG AAATCCGAGC CGAGGACCTG CTGGATTATC TGGCCCCGAT GCTGGAGCCG
ATCGCGGTGG AGGAGTTCAC CTTCTCCCGA GGCTGGCTGC GCAAGGAGGC CGCCAGGCTC
GGAGACTGGC GGTCGGCGGC GGCACAGCTC GGTCGCCAGC TCAATCTGCC GCCGTCGTAC
CTGCTCATCC ACCGGGTGAC ACTCGGCGCG ATCGGCATCC TGTGCCAGCT GGGCAGCACC
GGCCGCTTCC GGGACGAGAT GGAGCGCTGG CAACCCGGGT TCGCCGAACC GGGGACGGCG
ACCGCCCGCG CGGCCGAGGA CGCCAACCGG CCCGGTCGTC CCCTGCCCGC TCTCCCCGTC
CAGGACGAGG CCGGAATCGT CCGGCCGCTG GACGGCCCGG TCGTTCTGGC CGGCGGGTTG
CCCGGCCCGC GCAAACCCCG CAGACCAGGA CGGACCGGCA GGACGACCAA GGCCCGCAAG
GCCGGGAGGT CGGCGGCCGA CACCGCTCCG GCGACTGCCG ACGACCGGGC CACCGCCCCC
GCGGCGCTGC CCCTACAGGC CGAGCCGGCC GAACCGAGCC GGCCCTCTAC CCCGGCGAAC
CGCTCCGCCG CGGCTCGACC CGGGTCGGGC CGCAAGGCCG CCCCGCGCCG GGCACCGGAC
GAGGAGTCGT TCTCCGACCC CGCAACCGTG ACCGACGCTT ATTGA
 
Protein sequence
MSEIPRRAVV RTAKLATLPI GIAGRATLGV GKRIGGRPAE AVASELQQRT AAQIFRVLGE 
LKGGAMKLGQ ALSVFEAVLP DDVAGPYRAA LTRLQEAAPP LPAAVVHRVL AEELGADWRS
LFTSFDDVPA AAASIGQVHR AVWADGRAVA VKVQYPGAGP ALLADLTQLG RAARLFGAVT
PGLDIKPLVE ELKARIAEEL DYRLEAAWQG AFAEAYADEP DVVIPRPLAG SGRVLVSEWI
EGIPLSVIIA DGTPQQRDTA GLLLVRFLYS CPGRAGLLHA DPHPGNFRLL SDGRLGVLDF
GAVNRLPDGL PAPIGRLARQ TLAGDADAVE QGLRREGFIP PSAEIRAEDL LDYLAPMLEP
IAVEEFTFSR GWLRKEAARL GDWRSAAAQL GRQLNLPPSY LLIHRVTLGA IGILCQLGST
GRFRDEMERW QPGFAEPGTA TARAAEDANR PGRPLPALPV QDEAGIVRPL DGPVVLAGGL
PGPRKPRRPG RTGRTTKARK AGRSAADTAP ATADDRATAP AALPLQAEPA EPSRPSTPAN
RSAAARPGSG RKAAPRRAPD EESFSDPATV TDAY