Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3377 |
Symbol | |
ID | 5671748 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4001735 |
End bp | 4003183 |
Gene Length | 1449 bp |
Protein Length | 482 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641242265 |
Product | aldehyde dehydrogenase |
Protein accession | YP_001507685 |
Protein GI | 158315177 |
COG category | [C] Energy production and conversion |
COG ID | [COG1012] NAD-dependent aldehyde dehydrogenases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 17 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGTTGGAGC GGGAGCAGAT GTTCGTCGGA GGCGCGTGGG TCGCGCCGAG CACCGACAGC AGGATCGAGG TGACCTCGCC GCACACCGAG AAGCTGATCG GTCGCGTCGC CGCGGCGAGC ACGGAGGACG TCGACCGCGC CGTCGCCGCA GCGCGGCGGG CCTTTGACGA GGGGCCGTGG CCGCGGACCG ATCCGGCCGA GCGGGTCGAG GTGATCCGGC GGCTGGCCAC GCTCTACCGC GGGCATCGCG ATGAGCTGGC CGAGCTCATC ACCGCCGAGC TGGGCGCGCC GATCTCGTTC GCCAGGCGCG CGCAGGTCGC CCTGCCCGGG GCGCTGATGA CCGCGCTGGC CGACACAGCG GCCGGCTACT CCTGGCAGGA GAAGCGGCCG GGAACCTACG GCCAGGACAT CATCCTGCGC AAGGAGCCCG TGGGCGTCGT CGCCGCGGTG GTCCCCTGGA ACATGCCGCA GTTCCTGACC GTCACCAAGG TCATCCCGGC GCTGCTCGCC GGCTGCACCG TCGTGCTCAA GCCGGCGCCC GAGTCGTCGC TCGACGCGCT GTTCTTCGCC GACCTGCTCG ACCAGGTCGG CCTGCCGCCG GGCGTCGTCA ACGTCCTCCC CGCCGACCGT GAGGTGAGCG CCTACCTAGT CGCCCACCCC GGTATCGACA AGGTCTCCTT CACCGGCTCG ACGGCGGCCG GCAGGCAGGT GGCGGCGGCA TGCGCTCCCA ACCTCACCAA GGTCAGCCTC GAGCTCGGGG GAAAGTCGGC GGCCATCGCT TTGGACGACG CCGACCCGGC CACCGTGGCG CGCGCCGTCC GCCTCTCGGG CATGGGCATG GCCGGGCAGA TCTGCAACTC CCTCACCCGG GTACTCGTGC CCACGAGCCG CGTCGGCGAC TATGCCGATG CGCTCGCGGC AACCCTGTCG GCCATCAGAA TCGGCGATCC GGCCGATCCC GCGACCGAGA TGGGCCCGCT CGTGGCCAGG CGCCAGCAGG AACGGGTCCG CCAGTACATC GACACCGGCG TGCGGGAGGG CGCCCGGCTC GTCGTCGGTG GCACCGATCT GCCCGACGGG ATCGATCGCG GTTGGTACGT GCGACCGACC GTCTTCAGCG ACGTGGACAA CGCGATGACG ATCGCCCGGG AGGAGATCTT CGGCCCGGTC CTTGTCGTCA TCCCCTACGG CGACGAGGAC GAGGCCGTTC GCATCGCCAA CGACTCGGAG TACGGCCTGG CCGGCTCCGT GTTCACCGCG GACACCGGGC ACGGGCTCGA GGTGGCGGGC CGGGTCCGGG CCGGGACTTT CGGCGTGAAC CAGGGCTACT CCATGGATCC CGCGGCCCCC TTCGGGGGAG TGAAGGCCAG CGGCTACGGC CGCGAGCTCG GCCGTGAGGG CCTCGAGGGC TACCTCGACG TCAAATCGAT CTCCGTCGCC ACCGCGTAG
|
Protein sequence | MLEREQMFVG GAWVAPSTDS RIEVTSPHTE KLIGRVAAAS TEDVDRAVAA ARRAFDEGPW PRTDPAERVE VIRRLATLYR GHRDELAELI TAELGAPISF ARRAQVALPG ALMTALADTA AGYSWQEKRP GTYGQDIILR KEPVGVVAAV VPWNMPQFLT VTKVIPALLA GCTVVLKPAP ESSLDALFFA DLLDQVGLPP GVVNVLPADR EVSAYLVAHP GIDKVSFTGS TAAGRQVAAA CAPNLTKVSL ELGGKSAAIA LDDADPATVA RAVRLSGMGM AGQICNSLTR VLVPTSRVGD YADALAATLS AIRIGDPADP ATEMGPLVAR RQQERVRQYI DTGVREGARL VVGGTDLPDG IDRGWYVRPT VFSDVDNAMT IAREEIFGPV LVVIPYGDED EAVRIANDSE YGLAGSVFTA DTGHGLEVAG RVRAGTFGVN QGYSMDPAAP FGGVKASGYG RELGREGLEG YLDVKSISVA TA
|
| |