Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2503 |
Symbol | |
ID | 5670899 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 2979523 |
End bp | 2981442 |
Gene Length | 1920 bp |
Protein Length | 639 aa |
Translation table | 11 |
GC content | 79% |
IMG OID | 641241420 |
Product | hypothetical protein |
Protein accession | YP_001506841 |
Protein GI | 158314333 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.116558 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGGTCG GCTCCGGGCG TTCCGCGACG GCGCGGCTCG CCGCCTCGAC GGCGGCCACG ACGCTGCTCG TGGCCGCGGC GACCGTTGCC GCCGGGTGGC CGTACTGGGA GCGCCACCGG CTGGCCACCG TCCCGAACGT GGGCGTGGCG GTGGTGCTGG TCGTCGCCGG CATGCTGCTG CGCGGCGCCC CGTCGTCACG CCGGTGTGGG GCGCTGCTGG CCCTCGGCGG CCTGTGCTGC CCGCTGACCT GGCTGATGAG CTGGGACGTC GGCCCGCTGC CGCTGATCGC CGTCCTAGGC CAGTCCGCCT GCTGGCTGGC CTACGCCTGG GGGCTGCTGC TCTACCCGGG GAACACCCTG CGCGGCCGGG CCGACCGGTG GTGGGTCACG TCCGCGGCGG TGATCCTGCT GGGCGGGCAG GTGCTGACGG TCGTCATCTC CCGGCCGTCC TGGAACGGCT TCACCGACGG TGTGCTGTGG CCCGCGCCGT TGCTCGTCGG CCGGGCCCGT TTCGAGGCGG CGTTGGACGT CCTGCTCGTC ATCTACGTCC TGGTGCCCGT GACGTTCGTG CTGCTGGCGG TCGCCCGGAT CCGGGCGGCG CGGGGCCTGG AGCGGGTCGT CGCCGGTCCG GTGCTGCTCT CGGCGGGCTT CGTGGCGCTC GTCGCCGCGG TGACCTACCC GGGCCTGATG GCCGAACCCG AGCTGGGCCG CATCGAGGAC GCGGTCGCTC TGCAGGGGGC CGCGTCGATC GTCGCCCCCG TCGTGCTGCT CGCCGTCGGC GCCCGCCGCC GGCTGCTGCT GGCGACGACC GCCGACCGGT TCGGCCGGGA GATCGGGTCG CCGACACCGG CGTCGGTGCG GGCGGCGCTG CGGGCCCTGC TGCGCGACGC CACCCTCGAG GTCTACTACC GCCAGCCGGG CACATGCCGC CAGACGGACA CGGAGGTGTT GGTGGATCTG CACGGCCAGG TGGTGGGCCT CCCCCCGGAG GACCACCGGG GGGAACGGTC GGACCGGCCG GGCCCGGAAC AGCGTTGGTA CGTCCCGGTG CGGTCCGCGG CCGGCGCGCC GGTCGCGGTG CTGAGCATCG ATCCCGCCCT GCGCCGCCAC CGGCGGCAGG TGACGGCGGC GCTGGCCGCG GCGGCCGCGG CGCTGGAGCA GGCGCGGGCG CAGAGCGGCC TGCGCGCGCA GCTCGTCCGC CTGGGTGAGG AACGCCGGCG GGCCGCGCGC ACGCAGGCCC ACGAGTGGGC GCTGGTCGGG CGGGAGCTGG ACGACGGCGT CCGCCGGCGC CTGGCCGAGC TCGCCGCGGC GGCGGGGGAC GTCGCCCGGA CGGTGTCGGA ACCGGCCACC GCGCGGGCCC TGGCGGAGAT CGGCGAGGGG CTGCGCGCGG CGCACGGCGA GCTCGCCGGC ATCGCCCGGG AGGCCCATCC CGCCGTCCTG GAGCGGGACG GCCTGCTCCC GGCGTTGGAG AGTCTCGCCG CGGGGCTGGG GCTGGGCGGC CCGAGCCTGC TGCGGGTGCC CGCCGGCCGC TTCGACGCGA CGGCCGAGCG GGCGATGTAC GCGGCGCTCG CCGCCGCGCT GCGCGCCATC GCGGCCGCCG CCGCGGCTCC ACCGCCGGAC ACGGCCGCGG GGATGCCGGA GCCGGCCACG GCGGGGCCGG TGGTGTCAGA GCTGGCGGTG GGGCCGGTGG CGGAACCGGC CGCGGTGGGG CCGACGGCGG GCAGGAGAGC GTGGGGGGCC GCCCGGGTCG AGGTGCGCGC CGAGGGCGCG ATGCTCGTCG GCGAGGTCAC CTGCGCGGTG CCGGTGGCCG GCGGGGTGCG CGCCGCCGCC GACCACGCCC GGGCGCTGGG CGGTTGGGTC GCCGTACGCG GTGTCGCCGG CGGCGCCACC ACGACCCGGG TGACGGTCCC GTGCGGGTAG
|
Protein sequence | MPVGSGRSAT ARLAASTAAT TLLVAAATVA AGWPYWERHR LATVPNVGVA VVLVVAGMLL RGAPSSRRCG ALLALGGLCC PLTWLMSWDV GPLPLIAVLG QSACWLAYAW GLLLYPGNTL RGRADRWWVT SAAVILLGGQ VLTVVISRPS WNGFTDGVLW PAPLLVGRAR FEAALDVLLV IYVLVPVTFV LLAVARIRAA RGLERVVAGP VLLSAGFVAL VAAVTYPGLM AEPELGRIED AVALQGAASI VAPVVLLAVG ARRRLLLATT ADRFGREIGS PTPASVRAAL RALLRDATLE VYYRQPGTCR QTDTEVLVDL HGQVVGLPPE DHRGERSDRP GPEQRWYVPV RSAAGAPVAV LSIDPALRRH RRQVTAALAA AAAALEQARA QSGLRAQLVR LGEERRRAAR TQAHEWALVG RELDDGVRRR LAELAAAAGD VARTVSEPAT ARALAEIGEG LRAAHGELAG IAREAHPAVL ERDGLLPALE SLAAGLGLGG PSLLRVPAGR FDATAERAMY AALAAALRAI AAAAAAPPPD TAAGMPEPAT AGPVVSELAV GPVAEPAAVG PTAGRRAWGA ARVEVRAEGA MLVGEVTCAV PVAGGVRAAA DHARALGGWV AVRGVAGGAT TTRVTVPCG
|
| |