Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2944 |
Symbol | |
ID | 3903759 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 3477679 |
End bp | 3479199 |
Gene Length | 1521 bp |
Protein Length | 506 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 637880265 |
Product | aldehyde dehydrogenase |
Protein accession | YP_482031 |
Protein GI | 86741631 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 28 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.225427 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACGTACG TCCCACCTGG AAAGCCAGGC AGTATCGTCC ATGTTACGGA CCGGTACGAG AACTTCATCG GTGGAAAGTG GCTGCCGCCG CGGGCCGGTA AGTATTCGAC CAACCTCAGC CCGGCCACGG CGCAGCCGAT CTGTGAAATA CCGCGGTCGG CGGCGGAGGA TGTCGAGGAC GCCCTGGACG CGGCGCACGC CGCCGCCGGC TCCTGGGCCG CCGCGTCCCC GGCGGAGCGG GCCGAGGTGC TGACCGCGGT GGCGGATGCG ATCGACGCCA ACCGGGAGAT GCTCGCGGTC ACGGAGAGCT GGGAGAACGG CAAGCCGGTG CGGGAGACGC TGGCTGCCGA CATCCCGCTG GCCGCCGACC ACTTCCGGTA CTTCGCCGCG GCGGCGCGCA CCCTGGAGGG GTCGATCTCC GAGATTGACG GCAGGACCTA CGCCTATCAC TTCCACGAGC CGCTCGGGGT GGTCGGGCAG ATCATCCCCT TCAACTTCCC GTTGCTGATG GCGGCGTGGA AGCTGGCGCC CGCGCTGGCC GCGGGTAACT GCTCGGTGAT CAAGCCGGCG TCACCGACGC CGTGGTCGAT CCTCAAGCTC GCCGAGGTGA TTCAGGACGT CATCCCGCCC GGCATCATCA ACATCGTCAA CGGGCCCGGC GCCGAGGTCG GCAGAGCCCT GGCCACCAGC CCGAAGATCG CGAAGATCGG GTTCACCGGT GAGACCACGA CCGGCCGGCT GATGATGCAG TACGCGGCGC GGAACATCAT CCCGGTCACC CTGGAGCTGG GCGGAAAGTC CCCGAACATC TTCTTCGAGG ACGTCCTGGC CGCCGACGAC GCCTATCTGA ACAAGGCGGT CGAGGGGCTG GTGCTGTATG CGTTCAACAA GGGCGAGGTG TGCACCTGCC CGTCCCGGGC GCTGATCCAG GAGTCGATCT ACGACGAGTT CATGGCGCGG GCGCTGGAAC GGGTCGGCCG GATCCAGCAG GGCAACCCGT TGGACCCGGC GACAATGCTG GGCCCGCAGG TCTCCGCGCA GCAGCTGTCA AAGATCGGCT CCTACGTGGA CATCGGCCTG GCTGAGGGCG CCGAACTGCT GGCCGGTGGG CACCGGACGC GGCTGGCCGG AGAGTTCGCG AACGGCTACT TCTTCGAGCC GACGCTGCTG AAGGGCCACA ACAAGATGCG GGTCTTCCAG GAGGAAATCT TCGGCCCGGT GCTGGCTGTC ACCACCTTCA AGGACGAGGC CGAGGCGTTG GCCATCGCCA ACGACACGCC TTACGGGCTC GGTGCCGGTG TGTGGACGCG GGACGGTGCC CGCCAGTTCC GGATGGGCCG GGGCATCAAG GCCGGCCGGG TCTGGGTCAA CTGCTACCAC CAGTATCCGG CGGGCGCGGC ATTCGGCGGA TACAAGGTGT CCGGCATCGG CCGCGAAAAC CACCGGATGA TGCTGGAGCA CTACAGCCAG ACCAAGAACA TGCTGGTCAG CTACGACGAG AACGCGCTCG GCCTGTTCTA G
|
Protein sequence | MTYVPPGKPG SIVHVTDRYE NFIGGKWLPP RAGKYSTNLS PATAQPICEI PRSAAEDVED ALDAAHAAAG SWAAASPAER AEVLTAVADA IDANREMLAV TESWENGKPV RETLAADIPL AADHFRYFAA AARTLEGSIS EIDGRTYAYH FHEPLGVVGQ IIPFNFPLLM AAWKLAPALA AGNCSVIKPA SPTPWSILKL AEVIQDVIPP GIINIVNGPG AEVGRALATS PKIAKIGFTG ETTTGRLMMQ YAARNIIPVT LELGGKSPNI FFEDVLAADD AYLNKAVEGL VLYAFNKGEV CTCPSRALIQ ESIYDEFMAR ALERVGRIQQ GNPLDPATML GPQVSAQQLS KIGSYVDIGL AEGAELLAGG HRTRLAGEFA NGYFFEPTLL KGHNKMRVFQ EEIFGPVLAV TTFKDEAEAL AIANDTPYGL GAGVWTRDGA RQFRMGRGIK AGRVWVNCYH QYPAGAAFGG YKVSGIGREN HRMMLEHYSQ TKNMLVSYDE NALGLF
|
| |