Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_0638 |
Symbol | |
ID | 5669055 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 742461 |
End bp | 743669 |
Gene Length | 1209 bp |
Protein Length | 402 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 641239565 |
Product | amidohydrolase 2 |
Protein accession | YP_001505003 |
Protein GI | 158312495 |
COG category | [R] General function prediction only |
COG ID | [COG2159] Predicted metal-dependent hydrolase of the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.477012 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.622567 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCCGAGGC CGAACTGGCA GCTGCTTCCG GACCCAGAGC CGCAGGAACG CAGCTACTCG ATCGTCTCGG TGGACGATCA TCTGACCGAG CCTGCCGACA TCTTCGTCAA GCGGTTCCCG GCGAACCTTC GTGACAAGGC ACCTCAGGTG ATCACGACAC CAGACGGTTC GGAGGCCTGG GTCTACCGCG ACCGGCTCTA CCGTGACAAC GGAATGAGTG TCGTCGCCGG GCGCCCGCAG TCCGAGTGGA ACCTCGACCC GCTGAACTTC AGCGAGATGC GCCGGTCCGC GTGGGACGTC CATGCCCGCG TGAAGGACAT GGACCTCGAC GGCATCTGGG CGTCACTGTG CTTCCCCTCC GGGGCGTGGG GCTTCACCGG CCGCGTGCTG TCGATGAACA ACGACCAGGA GGTCGGGCTC GCCGCGGTCC GTGCCTGGAA CAGCTGGATG ATCGAGGAGT GGCACGGGGC GTACCCGGAG CGCTTCATCC CGATGCAGCT GCCCTGGTTC AAGGACCCCG AGGTCGCCGC CGAGGAGATC CGCCGCAACG CCGAGCTCGG CTTCACCTCG GTGTCGTTCC TCGAGTCGCC GCACCTGCTC AAGCTTCCGC CGATCACCAA CCACAAGCAC TGGGAGCCGT TCTTCAAGGC GTGCGAGGAG ACCGACACGG TCATTTCGCT GCACTGCGGC GCGAGCGGCT TCGTCCTGCA GGGCTCGCCG GGCGGCGGCC TGAACGTGCA GACGTCGCTC TTCCCGGCGG GCGCGTTCTG CGCGGCCGTG GACTGGGTGT GGGCGGGCAT CCCGGCGCTC TACCCGAACC TCAGGATCGC GCTGAGCGAG GGTGGCATCG GCTGGGTGCC GATGGCGATC AACCGCCTCG ACTACGTGCT CGAGCACTCC GGCAGCGGCG GCACGCCGTG GACGTACGAC GTGACGCCGA GCGAGGCGCT GCGCCGGAAC TTCTACTTCT GCATGCTGGA CGACCCCGGC ACGCTCGACC AACGTCACAT GATCGGGATC GACCACATCC TGTTCGAGAC GGACTTCCCG CACGCCGACT CGACCTGGCC GGGCTCGCAG GACCTGCTGC GCAAGCGTTT CGCCGACATC CCTCGGCACG AGGCCGTGAT GATCGCCGGT GGCAACGCCG CGAGGCTGTT CCGGCACCCG CTCCCGCAGG GCGGCGACTG GCCGGCGATC ACCCCGTAG
|
Protein sequence | MPRPNWQLLP DPEPQERSYS IVSVDDHLTE PADIFVKRFP ANLRDKAPQV ITTPDGSEAW VYRDRLYRDN GMSVVAGRPQ SEWNLDPLNF SEMRRSAWDV HARVKDMDLD GIWASLCFPS GAWGFTGRVL SMNNDQEVGL AAVRAWNSWM IEEWHGAYPE RFIPMQLPWF KDPEVAAEEI RRNAELGFTS VSFLESPHLL KLPPITNHKH WEPFFKACEE TDTVISLHCG ASGFVLQGSP GGGLNVQTSL FPAGAFCAAV DWVWAGIPAL YPNLRIALSE GGIGWVPMAI NRLDYVLEHS GSGGTPWTYD VTPSEALRRN FYFCMLDDPG TLDQRHMIGI DHILFETDFP HADSTWPGSQ DLLRKRFADI PRHEAVMIAG GNAARLFRHP LPQGGDWPAI TP
|
| |