Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4702 |
Symbol | |
ID | 5673044 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 5617369 |
End bp | 5618493 |
Gene Length | 1125 bp |
Protein Length | 374 aa |
Translation table | 11 |
GC content | 67% |
IMG OID | 641243559 |
Product | amidohydrolase 2 |
Protein accession | YP_001508975 |
Protein GI | 158316467 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.00593918 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 15 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGATTACA AGGTCATCTC GGCGGACAAC CACATCATCG AGGCGCCGCA CACGTTCACC ACCTATCTGC CGAAGGAGTA CCGGGACCGG GCGCCGCGCA TCCTGCGCGG CGCGGACGGC GGCGACGGCT GGAGCTTCGA CGGCAAGCCG CCGAGCAAGA CGTTCGGGCT GAACGCCGTA GCCGGCCGCC CGTTCGAGGA CTACAAGGCC AGCGGCCTGA CCGTCGATGA GATCCTCCCG GGCAACTACG ACGGCGCCGC CCACCTGAAG GACATGGACG CCGACGGCGT GGACGCCGCC ACCATCTACC CGATGGCCTC TCTCACGTCG TACACGCTCG ACGATCGACC CTTCGCCCTC GCCATCCTGC GGGCCTACAA CGACTGGCTG CTCGACGAGT TCTGCGCCGT CAACCCGCAG CGGCTCATCG GTCTGCCGCT CCTGCCGGTC GACGACGGCA TGGACGTCCT GCTCGCCGAA CTCGAGCGGG TGGCTGCCAA GGGTGCCAAG GGCGCCTTCC TCCCCTACTG GAGCGAGCGC CCGTACTACG ACAGCTACTA CGAGCCGCTC TGGACGGCGG CCGAGCAGGC ACCACTGACG CTGTGCATCC ACCGAACCAT GGGCGGGAAG GAACCGGCGG GGCAGGCCAC CCCAAGGCCG GAGGCCGCCG CGGGTGTCAA CCTCGCCGGT ATCGTCCAGC GGTTCTTCAC CGGCGTCGCG CCGTTCTCCC AGCTGACCTT CACCGGTGTG TTCGAACGGC ACCCCGGCCT GAAGTTCGTC GACGCCGAGG TCAACTTCGG GTGGCTGCGG TTCTGGGCCC TGATGATGGA CCAGGAGTTC GAGCGCCAGA AGCACTGGGC CAACCCGCCG CTGCACACCC CGCCCCACGA GTTCATCGGC AAGAACCTTT TCGTCAGCGT GCTCGACGAC TTCGTCGGCT TCGAAGACGC CAAGCGCGAC CCGCTCGTGG CGTCGGCCGC CATGTTCTCC ATCGACTACC CGCACAGCGG GACGCTGTTC CCGAAGACCC AGCAGTACAT CGCCGAGCTG ACCCCAGGCC TCGACGACGA CCGCAAGCAC GCCATCCTCG CGGGGAACGC TGTGCGGGTG TTCAACCTCG CATGA
|
Protein sequence | MDYKVISADN HIIEAPHTFT TYLPKEYRDR APRILRGADG GDGWSFDGKP PSKTFGLNAV AGRPFEDYKA SGLTVDEILP GNYDGAAHLK DMDADGVDAA TIYPMASLTS YTLDDRPFAL AILRAYNDWL LDEFCAVNPQ RLIGLPLLPV DDGMDVLLAE LERVAAKGAK GAFLPYWSER PYYDSYYEPL WTAAEQAPLT LCIHRTMGGK EPAGQATPRP EAAAGVNLAG IVQRFFTGVA PFSQLTFTGV FERHPGLKFV DAEVNFGWLR FWALMMDQEF ERQKHWANPP LHTPPHEFIG KNLFVSVLDD FVGFEDAKRD PLVASAAMFS IDYPHSGTLF PKTQQYIAEL TPGLDDDRKH AILAGNAVRV FNLA
|
| |