Gene Francci3_4306 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagFrancci3_4306 
Symbol 
ID3907274 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameFrankia sp. CcI3 
KingdomBacteria 
Replicon accessionNC_007777 
Strand
Start bp5142635 
End bp5144827 
Gene Length2193 bp 
Protein Length730 aa 
Translation table11 
GC content71% 
IMG OID637881633 
Productthymidylate kinase 
Protein accessionYP_483381 
Protein GI86742981 
COG category[F] Nucleotide transport and metabolism 
COG ID[COG0125] Thymidylate kinase 
TIGRFAM ID[TIGR00041] thymidylate kinase 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones15 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGCCACCG ACCATGCAGC CGGATCTCGG GACGCCGGGC GGCGCCCACC CTGGCCCGTC 
CGGCGACCGC GGGTCCGCCG CTCGCCGAGC AGGCATCCGG TGACGACGGT TCCCTCGCCG
CCAATGCCGT CCACCATCGC CGCCGGAGGC GACAGCGACA TCCGGGCCGT CCTGCGGATC
CCCGCGTTCC GCCGGATGTG GATTCAGCTC AGCCTGTCCA GCCTGGGTGA CTGGATGGGT
CTGCTGGCAA CCACCGCACT GGTGACCCAG CTCCAGGAGA GCTTCTCCGG GCAGGCGTAC
GCCATCGGCT CGCTGCTGAT CGTCCGGTTG ATGCCGGCCC TGATCCTGGG CCCACTCGCC
GGTGCCCTCG CCGACAGGCT GGACCGCCGG CTGACCATGG TCGTGACGGA CGTCATGCGG
TTCGGGCTGT TCATGTCGAT CCCGATCATC GGGACGCTCA AGTGGCTGCT CATCGCATCG
TTCCTGGTGG AGGCGGTCAC CCTGGTGTGG GCGCCGGCGA AGGACGCGAG CGTCCCGCAC
CTGGTGCCCA AGGAGCGGCT CGCGGCCGCC AACACCCTCA GCCTGATCAT GACCTACGGC
ACGGCCCCGA TCGCCGCGAC GATCTTCACG ATGCTTGCGA CGATCTCACG GACGCTCGGC
TCCAGCGTGT CGTTCTTCCA TGACTCCTCG GTCGACCTCG CGTTGTACTT CAACGCGGCG
ACGTTCCTCG GCTCGGCAAT CGTGATCTGG GGACTGAAGG GCATCGGACG GGCCGCGCGG
CCGGAGACCG GCCCCGAGCC CGGGTTCTTG GCCTCCATCA CCGAGGGATG GCGCTACGTC
GGCCAGGACA AACTGGTCCG CGGACTCGTC GTGGGCATCC TGGGCGGGTT CGCCGGCGCC
GGCTGCGTCA TCGCCCTCGG CCGGCTCTAC GTCGAGATCC TGGGCGGCGG CGACTCCGCG
TACGGGGTGC TCTTCGGCGC GGTGTTCGTC GGCCTCGCCG CCGGCATGGC CGCCGGTCCC
AAGCTGCTCG GCGACTACAG TCGGACCCGC CTGTTCGGCG TCTGCGTGAC CGGGGCGGGT
ATCACCCTGG TGATCGTCGC GATCATCCCG AACCTGGTCA TCGCCTGCAT CCTCGTGGTC
GCGGTGGGGG CGTTCGCCGG CGTGGCCTGG GTGACCGGAT ACACCCTGCT CCAGGCCGAG
GTCTCCGACG AGCTGCGGGG GCGCACCTTC GCCCTGGTGC AGTCGCTGGT CCGCATCGAC
CTGCTGCTCG TGCTCGCCGC CGCGCCCGGC CTGGTTGGCC TCATCGGCTC GCACAGGATC
CATCTGTGGG GCGACATCAA CGTCCGCGCG GACGGCGTCA CCGCGGTCCT GCTCGCCGGC
GGTCTGCTCG CCGTCGCGGT CGGCCTGTTC TCCTATCGGC AGATGGACGA CCCGACCGGA
GCCGCGGTCT GGCCCGAGCT GTGGAACGCG CTGCGCGGCC GGCGGCCGGC GGCCACACGT
CCCCGCCGCA CCGGCCTGTT CATCGCGATC GAGGGCATCG ATGGCGCCGG CAAGAGCACA
CAGGTGGAGC TGCTACGAAC GTGGCTCGCC TCCTCCGGCC GGGAGGTGGT GGTGACCTCC
GAATCCGGCG GCACCGCACT GGGAACGGGC CTGCGAGAAC TGCTCCTCGA CCCGACGGTA
CGGCTGCACC CCCGGACCGA GGCGCTGCTG ACCGCGGCGG ACCGCGCCGA ACACGTCGCG
CAGGTGATCG AACCCGCGCT GGCCCGGGGA GCGATCGTCA TCACCGATCG TTACGTCGAC
TCGTTCATCG CCTTCCAGAG CGTGAGCCAG GCCCTCAGCA CCGACGAGCT GTCCGTCCTC
ACCCAGTGGG CGACGCACGC GCTGCTTCCC GACGTCACGG TCCTGTTGGA TCTGCCGGCC
GAGATCGTCC TGCGCCGCGC GCGGATGACC GCGTATCCTG ATCCGCCGCC GGGGCCGCCG
GGGCCGCCGG ACGGCCCGGC CGAGGCGGAC GACGCCGGGA AGTTGACCTT CCAGGGCCGG
GTACGCGACA CGTTCCGTCG GCTGGCGCAG AAGGAGCCCG ATCGTTACCT TCCGCTAGAC
GCGACGCGGT CACCGGAGGA GATCCACGCC GAGATCCGCT CGTTCATGGC CAGCCGCATC
GATCGGCCAG CCGTCCTGCT CGGTGTGCTC TGA
 
Protein sequence
MATDHAAGSR DAGRRPPWPV RRPRVRRSPS RHPVTTVPSP PMPSTIAAGG DSDIRAVLRI 
PAFRRMWIQL SLSSLGDWMG LLATTALVTQ LQESFSGQAY AIGSLLIVRL MPALILGPLA
GALADRLDRR LTMVVTDVMR FGLFMSIPII GTLKWLLIAS FLVEAVTLVW APAKDASVPH
LVPKERLAAA NTLSLIMTYG TAPIAATIFT MLATISRTLG SSVSFFHDSS VDLALYFNAA
TFLGSAIVIW GLKGIGRAAR PETGPEPGFL ASITEGWRYV GQDKLVRGLV VGILGGFAGA
GCVIALGRLY VEILGGGDSA YGVLFGAVFV GLAAGMAAGP KLLGDYSRTR LFGVCVTGAG
ITLVIVAIIP NLVIACILVV AVGAFAGVAW VTGYTLLQAE VSDELRGRTF ALVQSLVRID
LLLVLAAAPG LVGLIGSHRI HLWGDINVRA DGVTAVLLAG GLLAVAVGLF SYRQMDDPTG
AAVWPELWNA LRGRRPAATR PRRTGLFIAI EGIDGAGKST QVELLRTWLA SSGREVVVTS
ESGGTALGTG LRELLLDPTV RLHPRTEALL TAADRAEHVA QVIEPALARG AIVITDRYVD
SFIAFQSVSQ ALSTDELSVL TQWATHALLP DVTVLLDLPA EIVLRRARMT AYPDPPPGPP
GPPDGPAEAD DAGKLTFQGR VRDTFRRLAQ KEPDRYLPLD ATRSPEEIHA EIRSFMASRI
DRPAVLLGVL