Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_4173 |
Symbol | |
ID | 5672528 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4960385 |
End bp | 4962076 |
Gene Length | 1692 bp |
Protein Length | 563 aa |
Translation table | 11 |
GC content | 77% |
IMG OID | 641243046 |
Product | amidohydrolase 3 |
Protein accession | YP_001508463 |
Protein GI | 158315955 |
COG category | [R] General function prediction only |
COG ID | [COG1574] Predicted metal-dependent hydrolase with the TIM-barrel fold |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 2 |
Plasmid unclonability p-value | 0.000539132 |
Plasmid hitchhiking | Yes |
Plasmid clonability | hitchhiker |
| |
Fosmid Coverage information |
Num covering fosmid clones | 11 |
Fosmid unclonability p-value | 0.473952 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGCACACTT GCCCACCGCG GACCACAGGT GAGCGCCGCC CACCCGGCAC AGTAGGGACC GTGAGCCCCC ACCACCTGCC GGCAGCCCGG CGCGTCCTCT ACCACCACGG ACACGTCCAC AGCCCCACCC ACCGGCACGC CACCGCCCTG CTCACCGACG GCCCCACCAT CACCTGGATC GGCACCGACG ACCAGCCCGA CGCCAGCGGG CCGCTGCCAG CCGGCCCGGT CGACCACACC GTCGACCTGC GCGGCGCCAC CCTCACCGCC GCCTTCGTCG ACGCCCACCT GCACACCACC GCCACCGGGC TCGCCCTCGA CGGCGTCGAC CTCACCGACA GCCCGTCCCT GCGCCACACC CTCGACCAAC TCACCCGCGC CGCCCACCAC CGGCCCGGCC AGCCACTGCT CGGCACCGGC TGGGACGAAA CCCGCTGGCC CGAACAGCGA CCCCCCACCA GCGGGGAACT CGCCCGCGCC GCCGGGCCCG TCGACGTCTA CCTCGCCCGC GCCGACGGCC ACACCGCCGT CATCTCCCCG CACCTGGCCA CCCGCAGCGG CGCCCACCAC GCCACCGGCT GGCTCGGCGA CGGCCTGTGC CGCGACGACG CCCACCACCT CGCCCGCACC GCCGCCTACC ACGACCTGCC CCCCACCACC CGCCGCGCCG CCGCCCGCCG CGTCCGCGCC CACGCCGCCA CCCTCGGCAT CGCCGCCCTG CACGAGATGG CCGGCCCGCA GGTCTCCTCC GCCGACGACC TCGCCGCCCT GCTCACCCTC GCCCGCGACG AACCCGGCCC CACCATCACC GGCTACTGGG CCGGTGAACT CACCGTCGCC ACCGCCCTGA ACACCGAACC CGGCCTGGGC CCCGTCGGCT ACGGCGGCGA CCTGTTCGTC GACGGCTCCC TGGGCTCACA CACCGCCGCG CTACGCAGCC CCTACACCGA CCAGCCCACC CACCGCGGGC AGCTGCACCG CGACGCCGAC GACGTCCGCG ACACCGTCCT CGACGCGGTC GCCGCCGGCC TGCAGACCGG CTTCCACGCC ATCGGCGACG CCGCCCTCGA CACCGTGCTC GACGGGGTGC GCGCCGCCAC CGCCCGCGTC GGCACCGCCA CGATCAGCGC CGGCACCCAC CGGGTCGAAC ACGCCGAGCT GCTGCACCCC GAACAGATCA TCGCCATGGC GCGCCTGGGG CTCGTCGCCT CCGTGCAGCC CGCGTTCGAC GCCCGCTACG GCGGCCCCGA CGGCCTGTAC ACCCGCCGGC TCGGCGCCGA CCGTGCCAGC GCGATGAACC CGTTCGCCGC GCTGCACCGC GCCGGGGTCG TACTCGCCCT GTCCTCCGAC AGTCCCGTCA CCCCCCTCGA CCCGTGGGGA GCGGTACGCG CCGCGGCCAC CCATCACACC CCGTCCGCGC GGATCAGCGG TGCCGCGGCG TTCACCGCCG CCACCCGCGG CGGCTGGCTG GCCGCCCGCG CCGGCGGTGA CGGTGCTGGA CGGATCACCG TCGGCGCGCC CGCGACCTTC GCGATCTGGG AGACCCCCCA CCCGCCGCGG CCGGCCAGGC CGCCAGCCGC CCAGCCCGCC GGGCCGCTGG ATGTTCTCCT TGACCAGCTT GACCGCACCG GCAGCGCGCC ACGCTGCCTG CGCACCGTGC TGCGCGGACA GACCCTGCAC GACCTGCTCT GA
|
Protein sequence | MHTCPPRTTG ERRPPGTVGT VSPHHLPAAR RVLYHHGHVH SPTHRHATAL LTDGPTITWI GTDDQPDASG PLPAGPVDHT VDLRGATLTA AFVDAHLHTT ATGLALDGVD LTDSPSLRHT LDQLTRAAHH RPGQPLLGTG WDETRWPEQR PPTSGELARA AGPVDVYLAR ADGHTAVISP HLATRSGAHH ATGWLGDGLC RDDAHHLART AAYHDLPPTT RRAAARRVRA HAATLGIAAL HEMAGPQVSS ADDLAALLTL ARDEPGPTIT GYWAGELTVA TALNTEPGLG PVGYGGDLFV DGSLGSHTAA LRSPYTDQPT HRGQLHRDAD DVRDTVLDAV AAGLQTGFHA IGDAALDTVL DGVRAATARV GTATISAGTH RVEHAELLHP EQIIAMARLG LVASVQPAFD ARYGGPDGLY TRRLGADRAS AMNPFAALHR AGVVLALSSD SPVTPLDPWG AVRAAATHHT PSARISGAAA FTAATRGGWL AARAGGDGAG RITVGAPATF AIWETPHPPR PARPPAAQPA GPLDVLLDQL DRTGSAPRCL RTVLRGQTLH DLL
|
| |