Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3816 |
Symbol | |
ID | 5672180 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4533893 |
End bp | 4535461 |
Gene Length | 1569 bp |
Protein Length | 522 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242695 |
Product | tail sheath protein |
Protein accession | YP_001508115 |
Protein GI | 158315607 |
COG category | [R] General function prediction only |
COG ID | [COG3497] Phage tail sheath protein FI |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.414451 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.268079 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAACT ACCGGTCGCC CGGCGTCTAC GTCGAGGAGG TCGAGGCGGG GTCCCGCCCG ATCGAGGGCG TCGGCACCGC GGTCGCCGCG TTCGTCGGCT TCACCGCGCG CGGTCCTCGC AACGACGCGG TAAGGATCGC CAACTGGGGC CAGTACGTGC AGGTCTTCGG CGACTTCGTA CCCGGCGCCT ACCTGCCACT GGCGGTCTTC CAGTACTTCA ACAACGGCGG CGGGGCCTGC TACGTGGTCA GCGTGGGCTC CGGCGGCGAC GGGGCGACCG CGGGCCCGGT GGCCGAGCTC ACCAGCGCCA CCCGTCCCGG CGTGGGTGTC TACCAGGTGG AGGCGATCGA CCCCGGGGCG CGCGGCGGCC TGGCGGTCGA GATCGCGCCG ATCGATCGGG CCGGCGGCGA GGCGGCGAAC GGGGGGGACG GTGACGGCGC GGCGGCAGAC GTCGGCCGCG CCTTCACCGT GATCGTCACG CAGGACGGCC GGCAGGTGGA GACGTTCGAG AACCTCACCG CCCGCCGCGG GAAGCGCAAT GTCGCCACCG TCATCAACGA GCAGTCGCAG CTGGTCCGGG TGATCGAGTT GGACGGGGTG ACCGCGCTCG ACCGTGCCCC CTCGCCCGGC ACGGTCGCGC TGCGCGCGCC GGCGCCGGCG CCGGCGCCGC GGGCGAGCGC CGAGGACTTC ATCGGCGACG TCTCCGCCCG TACCGGGGTG TGGGGGCTCG CTGCCTGCGA CGACGTCACC ATGGTCCTGG CTCCCGACCT GATGTTCTGC TACCAGCAGG GACAGATCGG CCTGGACACG GTGCAGGCCG TGCAGCAGGC GATGATCGAC CACTGCGAGA ACCTGGGCGA CCGGATGGCG ATCCTCGACC CGCCGCCTGG GCTGAACGCA CAGCAGATCA AGGACTGGGT GAAGGAGAAG GCCCGGTACG ACTCCAAGTA CGCCACCCTC TACTGGCCGT GGGTGTCGGT TTTCGACCCG GTGGAGGGCG GGCGCCGGTT CATCCCACCG TCCGGGCCGG TCGCCGGGAT CTGGGGCCGC AGCGACGACA GCCGCGGGGT GCACAAGGCA CCCGCCAACG AGGTGGTCCG CGGCGCCCTC GATCTGGAAA TCAACATCAC GAGGAGCGAG CACGACGCGT TGAACCCCGA GGGGATCAAC GTGATCCGCG CTTTCCCCGG CCGCGGGATC CGGGTCTGGG GCGCACGCAC CCTCTCGTCG GACCCGGCCT GGCGCTACGT CAACGTCCGG CGGCTGTTCA ACTATCTGGA AGCGTCGATC CTCAATGGCA CCCAGTGGGC GGTGTTCGAG CCGAACGACC TCGACCTCTG GCAGCGGCTG CGGCGCACGG TGTCCGCGTT CCTGCTGGGC ATGTGGCGGG ACGGCGCCCT CTTCGGTATC ACCCCGGACC AGGCCTTCTA CGTCAAGTGC GACGAGGAGA CGAACCCACC CGGTGTCGTC GACGCCGGCC AGGTGATCAT CGAGATCGGG GTCGCGCCGG TCAAGCCGGC CGAGTTCGTG GTCTTCCGGA TCGCTCAGAT CACCGCCGGC GCCGAGTAG
|
Protein sequence | MPNYRSPGVY VEEVEAGSRP IEGVGTAVAA FVGFTARGPR NDAVRIANWG QYVQVFGDFV PGAYLPLAVF QYFNNGGGAC YVVSVGSGGD GATAGPVAEL TSATRPGVGV YQVEAIDPGA RGGLAVEIAP IDRAGGEAAN GGDGDGAAAD VGRAFTVIVT QDGRQVETFE NLTARRGKRN VATVINEQSQ LVRVIELDGV TALDRAPSPG TVALRAPAPA PAPRASAEDF IGDVSARTGV WGLAACDDVT MVLAPDLMFC YQQGQIGLDT VQAVQQAMID HCENLGDRMA ILDPPPGLNA QQIKDWVKEK ARYDSKYATL YWPWVSVFDP VEGGRRFIPP SGPVAGIWGR SDDSRGVHKA PANEVVRGAL DLEINITRSE HDALNPEGIN VIRAFPGRGI RVWGARTLSS DPAWRYVNVR RLFNYLEASI LNGTQWAVFE PNDLDLWQRL RRTVSAFLLG MWRDGALFGI TPDQAFYVKC DEETNPPGVV DAGQVIIEIG VAPVKPAEFV VFRIAQITAG AE
|
| |