Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0040 |
Symbol | |
ID | 5668466 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 47146 |
End bp | 48660 |
Gene Length | 1515 bp |
Protein Length | 504 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641238969 |
Product | hypothetical protein |
Protein accession | YP_001504414 |
Protein GI | 158311906 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 16 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGCCC GGACCGGCCG CTACACCGCT GAGCAGTACG AACGCTGGCA CATCCGCGTC TGCGCGCGCT GCGGGCGGCG GGCGTCCCTA TCGGCGAACT GGTCGGACGG GCCGATCTGC CGCAGCTGCT ACGACCGGGC CGCCCGCACC TACGGTCGCT GCCCCGGTTG CCAGGCCGAA CGGCTCCTTC CCGGCCGCGA CGACGACGGG GCCGCCCTCT GCAGGGACTG CGCCGGCATC ACCCGCGACT TCTTCTGCTC CCGGTGCGGT TTCGAGGGCC TGCTGCTCGG CGGCCGGCTC TGCGAACGCT GCACCCTCAC CGACCAGCTC GCTGCCGCCC TCGACGACGG CACCGGGAAC GTCAGCCCGC CGCTGGTGCC GCTGCTCGAC GCGCTGCGCG TGATGCCGAA GCCGAAGTCC GGCCTGGCCT GGCTGCGCAA CCCCCGAGTC CGTGAACTGC TCGCGGACCT GGCCACGGGG CGCGTCGCGC TGACGCACGA GGCCCTGCAC GCGCTGCCGA ACTGGCGGAC CGTCGCCTAC CTGCGGGACC TGCTGATGGC CTGCGGCGTG CTACCCGCCG TCGACAAGCA GCTGCTGCAC CACGAGACCT GGCTGCACCG CCAGCTGGCC GAGCTCGACG GCCACCCCCA CGCCCGGCTG CTGCGCCAGT TCGGCACGTG GGCGCAGCTG CCGCGGCTTC GCCACCGGGC GGCGGCGCGC CCGCTGACCC CGCACGCCCG CAAGGAGGCG GCCGCGCAGT TTACCCAGGC CCGGCTGTTC CTGGCCTGGC TCGACGAGCG CGACCGGACA CCGGAAACGC TCACCCGGAC CGACGTCGAT GTCTGGCACG CCACCCACCT CGACCACGCG AAACGCTCCC TGCGGACGTT CCTGACCTGG GCGATGGACA GCGGCCATCT GCCCTGCCTC GACCTTCCCC GCCTCCAGAT CGTCCGCGCG GAGCCTCTCA CCCAGCGGCG CCGCCTCGAC CTGGTGAAAT CCGTCCTGAC CAGCGAGACC GGCTCGCCGC CGACCCGCGC CGCGGCCTGC CTGATGCTGC TCTACGCCCA GCCCGCCAGC CGCATCGTGC GCCTCACCGT CGACGACCTC ACCCGCGACG GCGACCAGGT CCTGCTCCGG CTCGGCGACC CGCCCGTCCC GGTTCCCGAC CCGTTCGCCA CGCTCCTGCT GACCGCCGCA ACCCGGCGGG ACAACATGAC CACCGCCACG AACCCGGACA GCCGCTGGCT GTTCCCCGGC CGCCGCGCCG GCCAGCCCCT GCACCCCTGC AGCCTGCTCG ACCAGATCCG CGCCCTCGGC ATCCCGATCC AGGCCGCCCG CACCGCCGCG CTACGCCAGC TCGTCCTGCA AGCCCCCGCC CCGGTCGTCG CCCAGGCCCT CGGCTACCAC CCGATCACCA CCCAGTGGCA CCGCGCCGAC GCCGGCGGCA CCTGGACCCA CTACGCCCCC GGCGATCACG CCGGGCCGAG CCCCACGCCG CCGGTCATCA CGTGA
|
Protein sequence | MTARTGRYTA EQYERWHIRV CARCGRRASL SANWSDGPIC RSCYDRAART YGRCPGCQAE RLLPGRDDDG AALCRDCAGI TRDFFCSRCG FEGLLLGGRL CERCTLTDQL AAALDDGTGN VSPPLVPLLD ALRVMPKPKS GLAWLRNPRV RELLADLATG RVALTHEALH ALPNWRTVAY LRDLLMACGV LPAVDKQLLH HETWLHRQLA ELDGHPHARL LRQFGTWAQL PRLRHRAAAR PLTPHARKEA AAQFTQARLF LAWLDERDRT PETLTRTDVD VWHATHLDHA KRSLRTFLTW AMDSGHLPCL DLPRLQIVRA EPLTQRRRLD LVKSVLTSET GSPPTRAAAC LMLLYAQPAS RIVRLTVDDL TRDGDQVLLR LGDPPVPVPD PFATLLLTAA TRRDNMTTAT NPDSRWLFPG RRAGQPLHPC SLLDQIRALG IPIQAARTAA LRQLVLQAPA PVVAQALGYH PITTQWHRAD AGGTWTHYAP GDHAGPSPTP PVIT
|
| |