Gene Francci3_3472 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_3472 
Symbol 
ID3905206 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp4139278 
End bp4141125 
Gene Length1848 bp 
Protein Length615 aa 
Translation table11 
GC content72% 
IMG OID637880794 
Productprotein-tyrosine kinase 
Protein accessionYP_482554 
Protein GI86742154 
COG category[D] Cell cycle control, cell division, chromosome partitioning
[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0489] ATPases involved in chromosome partitioning
[COG3944] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID[TIGR01007] capsular exopolysaccharide family 


Plasmid Coverage information

Num covering plasmid clones17 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGAGCTGC GCGACTACGT CCGTGTTCTG CGCCGAAGCT GGACGATCAT GATCGCCGGC 
GTGGTCCTCG GCGGGCTGCT GGCCGCCATG GCGACCTGGC GAACGACGAA AGAGTACGCC
GCCTCGGTGA CGATGGTCGT CTCCTCGCCG GACAGGGGCG CCGGAGCCGC CTCGGCCTAC
CAGGGAGGCC TGCTCTCCCA GCAACGCGTC AAGTCCTATG CCGACCTGGT GGCCAGCGAG
CGGGTGGCGA CGGCCGTGAT CGCACGCCTG CACCTGCACG CGACTCCGGA GGCCCTGCGG
GCCCAGATCA GCGCGCACGC CGTCCCGGAC ACCGTGCTGC TCCAGGCCGT CGTGCGCGAT
TCCGACCCGA GACGAGCCAT GATCATCGCA GATGCTGTCG GTGAGACGTT CTCCACCACC
ATTGCGAAGA TCGAGACGCC GTCGGCCGAC GAACCACCCT CCGTGCGGGT GACTGTCTGG
GAACACGCGA AGCTGCCCGT CTCGCCCGTC TCGCCCCAGC CGATCCGCAA CCTCGCGCTC
GGAGCGCTGC TCGGGCTGAT CGTCGGCAGC GCCGCCGGGA TCGTCCGATA CCGCCTCGAC
ACGAGCGTGT CCAGTGAGGA CGACGCCCGT GAGACGACCG AGCTGCCCAA CCTCGCCGTC
ATCGGCTACG ACGGTGCCGC CGACCGGCAT CCCCTCATCA TCAACGCCAA GCCCCGCTCG
GCCCGGGCGG AGGCGTTCCG CCAGCTGCGG ACCAACCTGC AGTTCGTCGA GGTGGACACC
GGACCTCGAT CGATCCTGGT GAGCAGTGCG GTCCCCGGCG AGGGCAAGAC CACCGTGGCC
TGCAACCTCG CGATCACCCT GGCGCAGGGC GGTGCGCGGG TCTGCCTGAT CGAGGGGGAC
CTGCGACGGC CGTCGTTCGG CGAGTACCTC GGGGTCGAGT CCGCCGCCGG GCTCACCTCG
GTCCTCATCG GCGCCGCGGA CCTCGACGAC GTCCTGCAGC CCTGGGGGGA GGGTCGGGTC
GGGGAGGGGC GCGTAGAGGT CCTCGCCAGC GGCCCCATCC CGCCGAATCC CAGCGAGCTG
CTCGGCTCCA AGGGCATGGC CGGACTCATC AACCTGCTGT CCGCCCGCTT CGACATCCTC
CTGGTCGACG CCCCGCCCCT GCTACCGGTG ACTGACGCGG CCGTGCTCGC CACCCGGGTC
GAGGGGGTTC TGCTGGTGAC GCGGGCCGGG CGGACCCGCC GCGAGCACCT CCGGCGGGCC
GTCGAGGCGC TGCGGGCGGT CGACGCCCGG ATGATCGGCA CCGTGCTGAA CATGGTGCCG
GTCAAGGGAC CCGACGCCTA CGACTACGGC CCGGGCGACG GTTACGTGTC ACGCGGCCGG
CACGCCAGGA CCTCCTCGCC CCGAACCCTG GAGATCCCGG CGCCCGATTC CGGGGTCCGG
CCTGCTTGGC CGGTCACCTC GCCTCCGTCG CTGCCGTCAC CGGCTCCTTC TTCTCCGGCT
CCTGCTTCTC CGGCTTCGGC CGTGACGCCG GTTCCCGCCG CCACGGCGCG TCTCGGGATA
GCCGACGGTG GCGACGAGTC GAGCTCGGCG CGGGACCCGC TGCCCGACGG CGCCACGACT
GTCCCGAGCG CCACGACTGT CCCGAGCGCC ACGACTGTCC CGAGCGCCAT GACTGTCCCG
AGCGCCACGA CTGTCCCGAG CGGCGGACCG CACGTCAGGG TGCCGGCGGC ACGCGGATCG
GCCGAGGAGA TCATCTCCGT CGGTGGAAGG ACACCGCGCG GTACGGACCC GGCTGGCGGG
ACGACGCCGG ATGACAACGG CGCGACCCGG CCGCCGTCCT CCCCATGA
 
Protein sequence
MELRDYVRVL RRSWTIMIAG VVLGGLLAAM ATWRTTKEYA ASVTMVVSSP DRGAGAASAY 
QGGLLSQQRV KSYADLVASE RVATAVIARL HLHATPEALR AQISAHAVPD TVLLQAVVRD
SDPRRAMIIA DAVGETFSTT IAKIETPSAD EPPSVRVTVW EHAKLPVSPV SPQPIRNLAL
GALLGLIVGS AAGIVRYRLD TSVSSEDDAR ETTELPNLAV IGYDGAADRH PLIINAKPRS
ARAEAFRQLR TNLQFVEVDT GPRSILVSSA VPGEGKTTVA CNLAITLAQG GARVCLIEGD
LRRPSFGEYL GVESAAGLTS VLIGAADLDD VLQPWGEGRV GEGRVEVLAS GPIPPNPSEL
LGSKGMAGLI NLLSARFDIL LVDAPPLLPV TDAAVLATRV EGVLLVTRAG RTRREHLRRA
VEALRAVDAR MIGTVLNMVP VKGPDAYDYG PGDGYVSRGR HARTSSPRTL EIPAPDSGVR
PAWPVTSPPS LPSPAPSSPA PASPASAVTP VPAATARLGI ADGGDESSSA RDPLPDGATT
VPSATTVPSA TTVPSAMTVP SATTVPSGGP HVRVPAARGS AEEIISVGGR TPRGTDPAGG
TTPDDNGATR PPSSP