Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4802 |
Symbol | |
ID | 5673143 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 5733475 |
End bp | 5734704 |
Gene Length | 1230 bp |
Protein Length | 409 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641243658 |
Product | hypothetical protein |
Protein accession | YP_001509074 |
Protein GI | 158316566 |
COG category | |
COG ID | |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.644094 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGCAGGC ACAGCACTCG CGTCGCGGTC GTCGGGGCCG CGGTGGCCGT CACCCTCGCC GGGGCGCTCG TCGGCACGGC GGCGGCGGAC GGGACCACGG TGGCCGACGT CACCTGGACG GCCGCCAACT CCGAGGCCAC CGGCGACCAG GACAACGCGG AGGTGTCCGC GACCCGCAAC GGCTACACCG CCGTGGTCTG GGAGGACGAC CGGGATACCA CGGCCCCCGA GGACACCCTT CACACCGAGG TGTACCTCCG CCTGTACCGC GACGGGACGT CGCTGTACGA GAAGAAGCTG TCGGCGGGCG GGAGCGGCAG CTGGCGGCAC GTCCAGCCCG ACGTCGCCCT GCGCGAGGAC GGCACCGCGG TGGTCATCTG GGCGGAGGAC CCTGACGGCA ACGGGTACTA CAACATCGCC GTGCGTGCGG TGAACACCGC GGGCACGGTG ACCGGCTCGG CCCAGGCCAA CGCGAACGCC GACGGGCAGC AGCTCAACGC GCACGTCGCG GCGGACCCGG ACGGCCCCGG GTTCGCGGTC GCGTTCGAGG ACGTCCAGGG CACCGCGGCA CCGACCGTGC GGGTGTCCGG GTTCGTGTCG GTGTCGTCCA AGACCTACGA GGTCCAGGTG CACGCGACGG GCGGCACCCA CCGCCGGCCC GACGTGGCGA CGGACGCCGC GGGCAACGCC GTCGTCGTCT GGGACGAGGA CGGTGACGGC AACGGGTCGT TCAACATCGG CCGGAAGATC TTCACCTCCT CGGGCGGTGT GAAGGCGGCG CAGTCCGTCG CCAACGTGAC GACCGCGGGG AACCAGCTCC ACCCGTCGGT GGCCGCGAAC CTCAACGGCG ACCAGGTCGT CGCGTGGGAG ACCGACCAGA ACGGGAGCGC GCAGGTCGGC GCCCGTTCCT TCAGCGCGGC GAACGCGGCC GGACCTGAGG TCGTCCTGCC CGGGGCGGAC CCGCAGAGCG GCATCGACGA CCAGCGCAAC GCGGTGGTGT CCTGGGGCGA GTCCACCGAC GTCCACGCCC AGGGTCTCAA CCCGGACGGC ACGGTCACCG GCCGACTGCC CCAGCTGCGG GTCCACACCA CCGTCGCCGG CAAGCAGAAC GAGCCGGCGC TCGGCGTCAA CCCCTGGGGC CAGATCGTGA TCGCCTACAC CGACGACAAC GACGGCAACG GCTTCGACCA GGTGTATCTG GGCACCGGCC TGGTCAACAG CACCTGGTGA
|
Protein sequence | MRRHSTRVAV VGAAVAVTLA GALVGTAAAD GTTVADVTWT AANSEATGDQ DNAEVSATRN GYTAVVWEDD RDTTAPEDTL HTEVYLRLYR DGTSLYEKKL SAGGSGSWRH VQPDVALRED GTAVVIWAED PDGNGYYNIA VRAVNTAGTV TGSAQANANA DGQQLNAHVA ADPDGPGFAV AFEDVQGTAA PTVRVSGFVS VSSKTYEVQV HATGGTHRRP DVATDAAGNA VVVWDEDGDG NGSFNIGRKI FTSSGGVKAA QSVANVTTAG NQLHPSVAAN LNGDQVVAWE TDQNGSAQVG ARSFSAANAA GPEVVLPGAD PQSGIDDQRN AVVSWGESTD VHAQGLNPDG TVTGRLPQLR VHTTVAGKQN EPALGVNPWG QIVIAYTDDN DGNGFDQVYL GTGLVNSTW
|
| |