Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3495 |
Symbol | |
ID | 5671866 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4158467 |
End bp | 4159975 |
Gene Length | 1509 bp |
Protein Length | 502 aa |
Translation table | 11 |
GC content | 72% |
IMG OID | 641242383 |
Product | 2-methylcitrate dehydratase |
Protein accession | YP_001507803 |
Protein GI | 158315295 |
COG category | [R] General function prediction only |
COG ID | [COG2079] Uncharacterized protein involved in propionate catabolism |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.375848 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 13 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGATCGACC ACAAGGTCCG CGTGTATCGG AGCGCCGAGC GGCTGGCCCG CGAGGACCAG CTCGCCTGGA AGATCGCCGC GGTCGCCACC GACCCGGTCG AGGTCACCGC CGACGTGACC GACATGGTGA TCAACCGGGT CATCGACAAC GCCGGGGTGG CCGCCGCGTC GCTGACCCGA AAGCCGGTGG CCGCCGCCCG CGACCAGGCG CTGCGGCACA TCCCCGCCCC CGGGCGTGAC GGCGCTACCG TGTTCGGCAC GCCGGCCGGC GTCCGCGTCT CCCCGGAGTG GGCAGCCTGG GCGAACGGCG TCGCCGTGCG CGAGCTGGAC TTCCACGACA CCTACCTGGC GGCGGACTAC TCCCACCCCG GGGACAACAT CCCGCCGGTG CTGGCCGTCG CCCAGCATCT GGGGCTGGAC GGTGCCGCCC TGCTGCGCGG GATCGCCACC GGCTACGAGA TCCAGGTCGC GCTCGTCCGT GGGATCTGCC TGCACCGGCA CAAGATCGAC CACATCGCGC ACCTCGGCCC GTCCGCCGCC GCCGGCATCG GCGCCCTGCT CGGCCTGCCG ACCGAGACCG TCCACCAGGC TGTCGGACAG GCCCTGCACA CGACCACAAC AACCCGGCAG TCACGCAAGG GCGAGATCTC GAGCTGGAAG GCGTACGCAC CGGCGTTCGC CGGCAAGCTC GCCGTCGAGG CGGTGGACCG GGCCATGCGC GGTGAGGGCG CGCCGTCACC GGCCTACGAG GGTGAGGACG GGTTCATCGC CTGGCTGCTC GACGGGCCCG GCGGGCAGTA CGTCGTCTCG CTGCCCGCGG CCGGGGAGGC CAAGCGGGGG ATCCTCGAGA CCTACACCAA GGAGCACTCC GCCGAGTACC AGAGCCAGGC GCTGATCGAC CTGGCCCGCC GCCTCGGCCC GCGGATCGGG GACTTTGCCC GAGTCCGCTC CATTGCGATC CACACCAGCC ATCACACCCA CTACGTGATC GGCTCCGGGG CGAACGACCC GCAGAAGTAC GACCCGAAGG CCAGCAGGGA GACCCTCGAC CACTCGATTC CCTACATCTT CGCGGTCGCG TTGCAGGACG GCGACTGGCA CCACGAGCGC TCCTACGCCC CCGAGCGGGC GACGCGGCCG GACACCGTCG CCCTCTGGCG GAAGATCACC ACGCTGGAGG ACAAGGAGTG GACGCGGCGT TACCACGCCA CCGACCCCGC CGAGAAGGCG TTCGGTGGCC GGGTCGTCGT CGAGCTCGAC GACGACACCG TGCTCACCGA CGAGATCGCC GTCGCCGACG CGCATCCGCT CGGCGCCCGT CCGTTCGGCC GGGACGAGTA CGTCGGCAAG TTCCGCCGCC TCGCCGAGGG CGTCATCCCC GGCCCCGAGC AGGACAGGTT CCTCGACACC GCCGCCCGCC TGCCCGAACT GACCCCGGAC GAACTCGCCG GCCTCACCCT CACTCCCGAG CCCGCACTCA CCACCGGGAG CACCCAGGGG ATCTTCTGA
|
Protein sequence | MIDHKVRVYR SAERLAREDQ LAWKIAAVAT DPVEVTADVT DMVINRVIDN AGVAAASLTR KPVAAARDQA LRHIPAPGRD GATVFGTPAG VRVSPEWAAW ANGVAVRELD FHDTYLAADY SHPGDNIPPV LAVAQHLGLD GAALLRGIAT GYEIQVALVR GICLHRHKID HIAHLGPSAA AGIGALLGLP TETVHQAVGQ ALHTTTTTRQ SRKGEISSWK AYAPAFAGKL AVEAVDRAMR GEGAPSPAYE GEDGFIAWLL DGPGGQYVVS LPAAGEAKRG ILETYTKEHS AEYQSQALID LARRLGPRIG DFARVRSIAI HTSHHTHYVI GSGANDPQKY DPKASRETLD HSIPYIFAVA LQDGDWHHER SYAPERATRP DTVALWRKIT TLEDKEWTRR YHATDPAEKA FGGRVVVELD DDTVLTDEIA VADAHPLGAR PFGRDEYVGK FRRLAEGVIP GPEQDRFLDT AARLPELTPD ELAGLTLTPE PALTTGSTQG IF
|
| |