Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_0621 |
Symbol | |
ID | 3903489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 704904 |
End bp | 706508 |
Gene Length | 1605 bp |
Protein Length | 534 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 637877954 |
Product | hypothetical protein |
Protein accession | YP_479734 |
Protein GI | 86739334 |
COG category | [G] Carbohydrate transport and metabolism [S] Function unknown |
COG ID | [COG0062] Uncharacterized conserved protein [COG0063] Predicted sugar kinase |
TIGRFAM ID | [TIGR00196] yjeF C-terminal region, hydroxyethylthiazole kinase-related [TIGR00197] yjeF N-terminal region |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.0172288 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.797481 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAATG CCCATACGGT CGAACAGGTT CGAGGTGCCG AGGCGCCGCT GCTCGCCTCG CTGCCCCCCG GCGCTCTGAT GCAGCGGGCG GTGCACGGGC TCGTCGCGCA CGCCGCACGA CGCCTCGGGC GGGTCTACGG TGCCCGCGTC GTCGTGCTCG CCGGCGGTGG CGACAACGGT GGTGACGCCC TCTGGGCCGG AGCGCGGCTG GCCGCCCGAG GGGCGCGGAT TGATGCGGTG GCACCCGGCC GCACCCACCC GGAAGGCACC GCAGCCCTGC TTGCGGCCGG TGGCCGACTG CACCATCCGG GTCCGGTGGT GGGCGCGGGC GGTCCGGCGG GTGCGGGGCA GCCGATCGAC GATGAGCGGC TGCGTATCCT GTTCGCCGCC GCCGATCTGG TGCTGGACGG GTTGCTCGGC ATCGGGGGCC GGGGCGGGCT GCGCGAGCCG TACGCCCGGC TCGCCGTTCT GGCACCCCCC GAACGGACGG TGGCGGTCGA CGTCCCCAGC GGGGTGGACG CGGACACCGG GGCCGTCGCG GGGCCGGCGG TCCGGGCGAG CAGTACCGTC ACCTTCGGTA CCCGCAAACG CGGCCTGTGG CTCATGCCCG GTGCCGCCCA CACCGGCCCC GTCGAGCTGG TCGACATCGG ACTGGACCTG CCTGAACCCG ACCTGTGCGC TCTCGACGAC GCCGACGTCG CCGCGGCATT GCCCGTTCCC AGCCCGACGG CGTCCAAGTA CAGTCGCGGG GTCCTCGGCC TCGTCGCCGG CAGCGACGCC TATCCGGGCG CCGCGGTGCT CGCGGTCGGC GGCGCGCTGC GTGGTGGCGC CGGGTACCTG CGGGTAGTGA CCGCGGGGCA CGCCGAGAAG GTCGCGGGCG AGGCCGTCGG CGCGGTCGTG AGCAGGGCCG GCGATTTCGT GCGGATGGCG CATCCCGAGG CCGTCGTGAC CGTCATCGAG GCCGGTGACG CCGACACGAT GCTGGCCGCC GGCCGGGTCC AGGCCTGGGC GATCGGCCCA GGCCTGGCGC CGGGACCCGC GGTCCGTACC CTGCTGACGG CGTTGCTGGC GACGGACCTT CCGGTCCTGG TCGACGCGGG GGGCCTCGAC CCGCTTGCCG AGATCATCGC GGCGCGGCCC GCCGTCGCCG AGCGGGCGGC GCCGGTGCTG ATCACACCGC ATGAGGGCGA GTTCCAGCGG TTCGTCTCGG TCGCCCTTGG CCGGGACGCG CAGGCCACCG TCGCCGAACT CGCCGACGAT CGGCTCGCTG TTCTCCGGCG GGCCGCGGCG GCGACGGGAG CGGTGATCCT GCTCAAGGGC GCCCGCACAC TCGTCGTGCG GCCGGATGGA TCGGCGCTGG TCAACACGAC CGGATCGGCA TGGCTGGGGA CCGCCGGGAC CGGTGACGTG CTGACCGGCC TGATCGGCTC GCTGCTCGCC GCCGGGCTCG CCCCCGCGAC GGCCGGGGCG GTCGGCGCCT ATCTGCACGG CCGGGCCGCC GAGCGGGCCC CGGCGCCGCT GGCAGCGAAT GATCTGCCCG CCCTCCTGCC CCAGGTCATC GGCGACCTGC TTGACCGGCG ACGGATGCCA CGCGGAGTAG CGTGA
|
Protein sequence | MRNAHTVEQV RGAEAPLLAS LPPGALMQRA VHGLVAHAAR RLGRVYGARV VVLAGGGDNG GDALWAGARL AARGARIDAV APGRTHPEGT AALLAAGGRL HHPGPVVGAG GPAGAGQPID DERLRILFAA ADLVLDGLLG IGGRGGLREP YARLAVLAPP ERTVAVDVPS GVDADTGAVA GPAVRASSTV TFGTRKRGLW LMPGAAHTGP VELVDIGLDL PEPDLCALDD ADVAAALPVP SPTASKYSRG VLGLVAGSDA YPGAAVLAVG GALRGGAGYL RVVTAGHAEK VAGEAVGAVV SRAGDFVRMA HPEAVVTVIE AGDADTMLAA GRVQAWAIGP GLAPGPAVRT LLTALLATDL PVLVDAGGLD PLAEIIAARP AVAERAAPVL ITPHEGEFQR FVSVALGRDA QATVAELADD RLAVLRRAAA ATGAVILLKG ARTLVVRPDG SALVNTTGSA WLGTAGTGDV LTGLIGSLLA AGLAPATAGA VGAYLHGRAA ERAPAPLAAN DLPALLPQVI GDLLDRRRMP RGVA
|
| |