Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_2978 |
Symbol | |
ID | 5671362 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 3503280 |
End bp | 3504569 |
Gene Length | 1290 bp |
Protein Length | 429 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641241882 |
Product | amidohydrolase |
Protein accession | YP_001507302 |
Protein GI | 158314794 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 29 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 20 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACGGCGC CGATCGTCCT GCGCGCCGCA CGCTGGCTGG ACATCGACGC CGGAGAGGTC CGCGCGCCCG CGGAGATCGT CGTAGAGGGC GACCGCATCA CCGCGGTGAA CCCCGCCACG CCGCCTGCGG GCAGCGTGGA ACTCGACCTG GGCGACGTCA CGTTGCTCCC CGGGTTGATG GACATGGAGC TCAACTTCCT CATCGGCGGG CCCGAGACCC CGACGGGACT GCCGCTGCCC ATGCACGGCG TGCAGGACGA CCCGGCGTAC CGCACCATCC GAGGAACCAT CAACGCCCGT ACGACGCTGC ACGCTGGCTT CACCACGGTG CGCAACCTCG GTCTGATGGT CAAGACCGGG GGGTACCTGC TCGACGTAGC ACTCCAGCGC GCCATCGACC AGGGATGGGC CGAGGGCCCG CGGATCATCG CGGCCGGGCA CGCGGTGACC CCGTACGGCG GGCACCTCGA CCCGACCGTG TTCCAGCGGC TGGCGCCCGG CGTCATGCCC CTGTCGATCG GCGAGGGGAT CGCCAACGGT GTGGGCCAGG TGCGGGAATG CGTCCGCTAC CAGATCCGCC ACGGTGCCAG GGTCATCAAG GTGTCGGCCT CCGGCGGGGT GATGTCGCAC AGCACCGGGC CGGGCGCCCA GCAGTACTCC GACGAGGAGC TGGCCGCGAT CGCGGACGAG GCGCACCGGG CCGACATCCG GGTCGCCGCG CACGCGGTGG GCGACCGGGC GATCCGGGCC TGCGTGCGTG CCGGGATCGA CTGCATCGAG CACGGCTTCC TCGCCAGCGA CGACACCCTC AAGCTGATGG CCGACCACGG CACGTTCCTG GTGTCGACGA CCTACCTGAC CGACGCGATG GACATCGCCC GGGCCGCACC CGAGCTGCGC AAGAAGGCGG CAGTGGTGTT CCCCCAGGCC AGGGCGATGC TCCCGAAGGC GATCGCGGCC GGGGTGCGGA TCGCCTGCGG CACCGACGCG CCCGCCGTGC CACACGGTCA CAACGCCAAG GAACTGATCG CACTGGTGTC GCGGGGCATG ACTCCCGTCC AGGCCCTGCG GGCCGCGACC GTCACCAGCG CGGAGCTCGT CGAACTGGAC CACGAGCTCG GACGGTTGAA GGCCGGCTAC CTCGCCGACA TCATCGCCGT CCCCGGCGAC CCCTCCCAGG ACATCACCCG CACCGAGGAC GTCCGCTTCG TCATGAAGGA CGGCCTCGTC CACCGCGACG ACCGATCGTC ACCGACCCGA ACGGAGCACA CATGGCAAGC GGCGTCCTGA
|
Protein sequence | MTAPIVLRAA RWLDIDAGEV RAPAEIVVEG DRITAVNPAT PPAGSVELDL GDVTLLPGLM DMELNFLIGG PETPTGLPLP MHGVQDDPAY RTIRGTINAR TTLHAGFTTV RNLGLMVKTG GYLLDVALQR AIDQGWAEGP RIIAAGHAVT PYGGHLDPTV FQRLAPGVMP LSIGEGIANG VGQVRECVRY QIRHGARVIK VSASGGVMSH STGPGAQQYS DEELAAIADE AHRADIRVAA HAVGDRAIRA CVRAGIDCIE HGFLASDDTL KLMADHGTFL VSTTYLTDAM DIARAAPELR KKAAVVFPQA RAMLPKAIAA GVRIACGTDA PAVPHGHNAK ELIALVSRGM TPVQALRAAT VTSAELVELD HELGRLKAGY LADIIAVPGD PSQDITRTED VRFVMKDGLV HRDDRSSPTR TEHTWQAAS
|
| |