Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2784 |
Symbol | |
ID | 3904930 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3277160 |
End bp | 3278905 |
Gene Length | 1746 bp |
Protein Length | 581 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880106 |
Product | aldehyde dehydrogenase |
Protein accession | YP_481872 |
Protein GI | 86741472 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGCCGA CCCCGCAGAA CCCGCAGAAC CCGCAGAACC CGCAGACCGT AGCGAACCGG GAGACCGGGG TGACTGAGGG GGTCGGTCCG GCGGTCTACC CGGCGGCCGG CATCGGGACG CTCCTGTCGA CGGATCCGGC CACCGGCGCG GTGGTCGCGA GCTTCCCGGT GATGGAGCCG ACCCAGGTGC GGGCGGCCGT GGCGTCCGCC CGGTCGGCGG CGGGCTGGTG GGCGGGGCTC GCGGCGGCAC GGCGGCGCGA CCATCTGCTG CGCTGGGCGG CGCACCTCGT CCGTCACGAG ACCGAGCTGA TCGACCTCCT GCACGCCGAG AACGGCAAGA CCGCCGCCGA TGCCCGCATC GAGCTGCTGC TGACGTTGGA ACACATCCGC TGGGCGGCCC GCAACGCCGC CCGCGTGCTG CGGACCCGCC GGGTGTCGCC CGGCCCGTTC CTGGCGAACC ACACCGGACG GATCGAGTAC CGCCCGTTCG GGGTCGTCGG GGTGATCGGG CCGTGGAACT ATCCGCTGTT CACCCCGAGC GGTTCGATCG CGTATGCCCT GGCAGCCGGT AACACGGTGG TCTTCAAGCC CAGCGAGTAC ACCCCGGCGG TGGGCGCGTA CCTGGTCGCG GCGTTCGCCG CGGCGAATCC GGACGCGCCG GCCGGGGTCC TGACCATGGT CACCGGCTTC GGTCCGACCG GTGCCGCCCT GTGTACAGCC GGGGTCGACA AGATTGCCTT TACCGGTTCG CCCGCGACCG GCCGCCTGGT CATGGCGGCC TGCGCGTCGT CCCTCGTCCC GGTGGTGATC GAATGCGGCG GCAAGGACCC GCTGATCGTC GCCGACGACG CTGACGTGGC CGCCGCCGCG CGGGCAGCGG CCTGGGGAGC GATGTCCAAT GGCGGCCAGA CCTGTGCCGG GGTTGAGCGG ATCTACGTGA CCGAGGCGGT CGCCGCGCCC TTCCTCGCCG CACTGCGCCG CGAGCTCGAC GGGGTTCGTC CCGGTGCCGA CCGGGACGCA TCCTACGGTC CGATGACGAT GCCCGGCCAG GCGGCGATCG TGCGCCGCCA CGTGGCTGAC GCGCTGGCCC GCGGCGCGAC CGCCCTGATC GGCGGGACGG AGTCGGTGGG GGACACCTTC ATCGAACCGG TCGTGCTGGT CGACGTGCCG GAGGGCAGCC CCGCCGTGCA GGAGGAGACG TTCGGGCCGG TCGCCACCGT CCGGACCGTC GCCGACGTTG ACGAGGCCGT CACGCTGGCC AACGGCACCC CGTACGCGCT CGGTGCGACG GTTTTCTCCC GGTCCCGGGG AGACGAGATC GCCAGCCGGC TCGACGCTGG GATGGTGTCC GTCAACGCGG TGCTGGCGTT TGCCGGGATG CCCGCCCTGC CCTTCGGGGG CAGCGGCGAA AGCGGGTTCG GCCGGGTGCA CGGCGCGGAG GGGCTGCGCG AGTTCGTCCG CCCCCGCTCG GTCGCCACCC TGCGGATGCA CGTGCCGGGA GCGACCCTGA CCACCTTCCG CCGGACGCCG GGTGCGCTCG CCGTCACCGC GGTGACGGCG CGGCTCCGGC ACGGCGGCGG CTGGGCCGGC TGGGCCAGCC GGGTGGCCGG GCGGGTCAGG CCGGTCGGCG GGGCGGCCGG GCCTCTCGAC AGGCCCCTCG GCGGGCACGT CGGGTCGGAC GAGGGATTCC GTCCCGGCCG GCTCCGGCTC CGAAAATCAT CCGGAAGTCG TCCAGACGGA CGGTGA
|
Protein sequence | MRPTPQNPQN PQNPQTVANR ETGVTEGVGP AVYPAAGIGT LLSTDPATGA VVASFPVMEP TQVRAAVASA RSAAGWWAGL AAARRRDHLL RWAAHLVRHE TELIDLLHAE NGKTAADARI ELLLTLEHIR WAARNAARVL RTRRVSPGPF LANHTGRIEY RPFGVVGVIG PWNYPLFTPS GSIAYALAAG NTVVFKPSEY TPAVGAYLVA AFAAANPDAP AGVLTMVTGF GPTGAALCTA GVDKIAFTGS PATGRLVMAA CASSLVPVVI ECGGKDPLIV ADDADVAAAA RAAAWGAMSN GGQTCAGVER IYVTEAVAAP FLAALRRELD GVRPGADRDA SYGPMTMPGQ AAIVRRHVAD ALARGATALI GGTESVGDTF IEPVVLVDVP EGSPAVQEET FGPVATVRTV ADVDEAVTLA NGTPYALGAT VFSRSRGDEI ASRLDAGMVS VNAVLAFAGM PALPFGGSGE SGFGRVHGAE GLREFVRPRS VATLRMHVPG ATLTTFRRTP GALAVTAVTA RLRHGGGWAG WASRVAGRVR PVGGAAGPLD RPLGGHVGSD EGFRPGRLRL RKSSGSRPDG R
|
| |