Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0308 |
Symbol | |
ID | 5668732 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 364424 |
End bp | 366553 |
Gene Length | 2130 bp |
Protein Length | 709 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641239239 |
Product | dTMP kinase |
Protein accession | YP_001504680 |
Protein GI | 158312172 |
COG category | [F] Nucleotide transport and metabolism |
COG ID | [COG0125] Thymidylate kinase |
TIGRFAM ID | [TIGR00041] thymidylate kinase |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.233817 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCCCTCGC CACCGCGTGC GCGTACCACC GGCGGCTCGG ACGGCGACCT GCGGGCGGTT CTGCGGATTC CCGAGTTCCG CCGGATGTGG ATCCAGCTCA GCCTTTCCAG CCTGGGCGAC TGGATGGGCC TGCTGGCGAC CACCGCCCTG GTCACCCAGC TGACCGAGAG CTTCTCCGGA CAGGCGTTCG CGATCAGCTC CCTGCTGATC GTCCGTCTGC TGCCCGCGCT CGTGCTGGGC CCGCTCGCCG GAGCGATCGC CGACCGCCTC GACCGCCGGA TGACCATGGT CATCACCGAC GTGATGCGGT TCGGGCTGTT CCTGTCCATC CCGATCGTGG GAACCCTCGG ATGGCTACTG ATCGCGTCCT TCCTCGTCGA GTGCGTGAGC CTGGTGTGGG CACCGGCCAA GGAGGCCTCG ATCCCGCACC TGGTCCCGCG TAACCGGCTC GCGGCGGCCA ACACCCTCAG CCTGATAACG ACCTACGGCA CGGCGCCGCT GGCCGCGGCG ATCTTCACGC TGCTCGCCAC CGTGTCCCGG GCCCTGGGCC CGTCCATTCA CTTCTTCCGC GACTCCTCTC TGGACATCGC GCTGTACTTC AACGCGGCCA CGTTCCTCGG GTCGGCGATC GTCATCTGGG GCCTGCGGTC GATCGGCCGC GCGGAACGGC CCGAGCACAG CACGGAGCCG GGCTTCTTCG CCTCGATCAC CGAGGGCTGG AAGTTCGTCG GCCAGGACCG CCTCGTCCGC GGGCTGGTCG TCGGAATCCT CGGCGGCTTC GCCGGCGCCG GCTGTGTGGT GGCGCTCGGC AAGCTCTACG TCGAGATCCT CGGCGGTGGC GACTCGGCCT ACGGCGTGCT GTTCGGCGCG GTGTTCATCG GCCTCGCCGC CGGAATGGGC GCCGGCCCGA AGCTGCTCGG GGATTACAGC CGCACCCGCC TTTTCGGCGT CTGTGTCACC GCCGCGGGAA TCACGCTGGT GGTCGTCGCG ATCATCCCGA ATCTGGTGAT CGCCTGCATT CTGGTCGTGT TCGTCGGCGC CCTCGCCGGC GTCGCCTGGG TGACCGGGTA CACACTGCTG CAGGCCGAGG TCGCCGATGA GCTGCGCGGG CGCACCTTCG CGCTGGTCCA GTCGCTGGTA CGGGTCGACC TGCTGGTCGT ACTCGCGGCC GCGCCGGCCC TCGTCGGTCT GATCGGTTCG CACCAGGTCC ACCTCTGGGG TGACATCAAC GTCCGGGCCG ACGGTGTCAC CGTCGTCCTG CTCGCCGGCG GGCTGCTCGC CGTCGCGGTC GGCCTGTTCT CCTACCGCCA GATGGACGAC CACACCGGGG AAGCCGTCCT GCCCGAGCTG TGGAACGCGC TGCGCGGGCG GCGGCCGGGC ACTGCCCGCC GCCGTCATGG CGGATTGTTC ATCACCGTCG AGGGTGCCGC GGGCGCCGGC AAGACGACCC AGCTGGAGCT GTTGCGAAGC TGGCTGTCCT CGAGCGGGCG CGAGGTCGTT CCGGCGTACG AGACCGACGG GACGTCCCTC GGCGCGGGCC TGCGGGACCT GCTGGACGAC CCCGCGAACC GGCTGCACGC CCGGACCGAG CTGCTGCTGG ACGCGGCCGA CCGCGCCGAG CACGTGGCCC GGGTCATCGA ACCGGCGCTG GCACGGGGCG CGATCGTGCT CACCGACCGT TACGTGGACT CCGCGATCGC GTGCCAGGCG CTCGGCCAGC GGATCGACAG CGACGAGCTG ACGGTGCTGA CCCAGTGGGC CAGCCACTCG TTGCTGCCCG ACGTGACGAT CCTGCTGGAC CTGCCTGCGG AGCAGGCGCT GGCCCGAGGT GGGAGTGATC CTCACCTCGG GCGGGAGAGC GAGGGCGACG GCAGCGGACA CGATGGCGGT GCCGGGCGCG GCGGCGGTGC CGGGCACGGC GCGGGGCCGG ATCGCTCCGG CACCGACCCG GTCGCGGCCG TGGCCTACCA CCGACGGGTC CGCGAGCACT TCCGCAGGCT CGCGGACGAG GATCCGGACC GGCACGTCAT CCTGGACGCG ACGCTCTCCC CGCAGGAGCT GCACCGGCAG ATCCGTGCGG TGATCGCCGG ACGCCTGCGC CCGGCGATCA CCGGTGACGA CCCGCGATGA
|
Protein sequence | MPSPPRARTT GGSDGDLRAV LRIPEFRRMW IQLSLSSLGD WMGLLATTAL VTQLTESFSG QAFAISSLLI VRLLPALVLG PLAGAIADRL DRRMTMVITD VMRFGLFLSI PIVGTLGWLL IASFLVECVS LVWAPAKEAS IPHLVPRNRL AAANTLSLIT TYGTAPLAAA IFTLLATVSR ALGPSIHFFR DSSLDIALYF NAATFLGSAI VIWGLRSIGR AERPEHSTEP GFFASITEGW KFVGQDRLVR GLVVGILGGF AGAGCVVALG KLYVEILGGG DSAYGVLFGA VFIGLAAGMG AGPKLLGDYS RTRLFGVCVT AAGITLVVVA IIPNLVIACI LVVFVGALAG VAWVTGYTLL QAEVADELRG RTFALVQSLV RVDLLVVLAA APALVGLIGS HQVHLWGDIN VRADGVTVVL LAGGLLAVAV GLFSYRQMDD HTGEAVLPEL WNALRGRRPG TARRRHGGLF ITVEGAAGAG KTTQLELLRS WLSSSGREVV PAYETDGTSL GAGLRDLLDD PANRLHARTE LLLDAADRAE HVARVIEPAL ARGAIVLTDR YVDSAIACQA LGQRIDSDEL TVLTQWASHS LLPDVTILLD LPAEQALARG GSDPHLGRES EGDGSGHDGG AGRGGGAGHG AGPDRSGTDP VAAVAYHRRV REHFRRLADE DPDRHVILDA TLSPQELHRQ IRAVIAGRLR PAITGDDPR
|
| |