Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2509 |
Symbol | |
ID | 3904653 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2962483 |
End bp | 2964396 |
Gene Length | 1914 bp |
Protein Length | 637 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637879839 |
Product | hydantoinase/oxoprolinase |
Protein accession | YP_481605 |
Protein GI | 86741205 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.212275 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 18 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACACCC TGATCAACAT CGACAACGGT GGCACGCTTA CCGACATCTG CGTCTGGGAT GGCGACCAGT TCACCTACAC CAAGTCCCTC ACTACTCCGC ACGACCTGTC GGAGTGCCTC TTCGACGGGA TCGAGAAGGC CTCGGTCGCC CTCTACGGCG AAGCCAACAC GGAGAAGCTG CTGCACGCTA CGCAGCACAT CCGATACTCG ACCACCCAGG GCACCAACGC CCTGGTCGAG CGGCGGGGCC CGATGATCGG CATCCTCACC ACGATGCCGG GCCTGTTCGA GCGGATGCGC GGCGGTGAAG CCGAGAAGGA CCTCTTCGAC GGCCTGATCG CCGACCGGAT GCTCACCATC GACATGGGCG CGACCGATGA GGAGATCGAC TTCGAAGTCG TCCAGCGGAT CAACCAGCTC ACCACCCTGG GCGCCGCCCG GGTGGTGGTG GCTGGTGAGT CGCCGGAACA GGAGCGTCGC CTACGCAGCG TGTTGCTGCG CAAGTTCCCG CGGCACCTGC TCGGCTCGAT CCCGCTCCTG TACTCGTGGT CCCTCGCCGG CGACCGGGAC CACCCGCGTC GGGTGTGGTC GTGCGTGCTC AACTCGTTCC TGCACCCGAC CATGGAGCGG TTCCTGTACG GGGCGGAGCG GCGGCTGAAG TCGTACAGGG TGCTCAACCC GCTGCTGGTC TACCGCAACG ACGGAGCCTC CTCGCGGGTG GCCAAGGCGG TGGCGCTGAA GACGTACTCC TCCGGGCCGC GCGGCGGCTT GGAAGGTACG GCGGCGCTGG CTCGCACGTA CGGCCTCGAT CACACGCTGA TGATGGACAT CGGCGGCACC ACCACCGACG TGGGGGTCGT CCGCGGCGGC GCGGCGGCGG CCGATGAGCG CGGCACCATC GAGGGCGTCC CGATCTCCTA CCCGATGAGC AACGTGCACT CGACCGGGGT CGGTGGGTCC TCGGTGATCT CGGTGGTGGA CGGGCAGATC ATGGTCGGGC CGCGCAGTGT GGGGGCGGCC CCCGGCCCCG CCTGCTTCGG CTTCGGTGGC AAGGAAGCCA CGATCACCGA CGTCAACCTG CTGCTCGGCG TCCTCGACGC CAGCACCTAC CTCGACGGCA CCTTCCGGCT CGACGCCGAC CGTTCGGCCG CGGTCATCAC CGAGACCATC GCCGAGCCGC TGGGCATCAG CCTGGAGGAG TCGCTGATCC GGATGGAGCG GGCCTACTTC GAGGCGCTGG CACACTCCTT CGCGCACCTG ATCGAGGAGA ACTCGACCCT CATCGCCTTC GGTGGCGCCG GCCCGATGAG CGCCGTTGGT GCCGCTCGTC CGGCAGGGGT GAAGAAGGTG CTGATCCCGC GGATGGCGGC GGTCTTCTCC GCGTTCGGCA TTGCCTTCTC CGACATCGGC AAGACCTACG AGGTCGGCGT GCCGGAGCCG ACCACGGCAA GCACCGCGGC GACGTACGAC GAGATGCTTG CCCGGGCCAG GCGCGACATT TTCCAGGAGG GCTACGACCT CGACGACTGC CGCACCGAGG TACTGCTCAC CATCGAGGAG ACCGACGGGT CGCCGGTGGA GACCAGGCCG TACCAGTCCG GCGACGCCGC GGACTTCCCC GGGAAGCAGG TCTCCCTGCA ACTGTCGGTG ACGGCCGCGC TGCCGCACCC CGACGTTGCT CCCGACACCG ACGTGCCCGC GATCCGGGTG ACGAGCAATG AGACCCGCCT GGTCCGTTCC GCGCCCGACC AGGTCGACAA GGTGCCGGTG TTCGTGCTCG CCGAGATGCC GCCCGGTGGA AGTGGCGAGG GCCCGGTGAT CGTCGAGGGC CCGTTCTTCA CCGCCCGCGT GCTGCCCGGC TGGCAGTTCC GGGTCACCGC CTCGGGGGAC CTGCTGCTGA CCGACACCCA CTGA
|
Protein sequence | MDTLINIDNG GTLTDICVWD GDQFTYTKSL TTPHDLSECL FDGIEKASVA LYGEANTEKL LHATQHIRYS TTQGTNALVE RRGPMIGILT TMPGLFERMR GGEAEKDLFD GLIADRMLTI DMGATDEEID FEVVQRINQL TTLGAARVVV AGESPEQERR LRSVLLRKFP RHLLGSIPLL YSWSLAGDRD HPRRVWSCVL NSFLHPTMER FLYGAERRLK SYRVLNPLLV YRNDGASSRV AKAVALKTYS SGPRGGLEGT AALARTYGLD HTLMMDIGGT TTDVGVVRGG AAAADERGTI EGVPISYPMS NVHSTGVGGS SVISVVDGQI MVGPRSVGAA PGPACFGFGG KEATITDVNL LLGVLDASTY LDGTFRLDAD RSAAVITETI AEPLGISLEE SLIRMERAYF EALAHSFAHL IEENSTLIAF GGAGPMSAVG AARPAGVKKV LIPRMAAVFS AFGIAFSDIG KTYEVGVPEP TTASTAATYD EMLARARRDI FQEGYDLDDC RTEVLLTIEE TDGSPVETRP YQSGDAADFP GKQVSLQLSV TAALPHPDVA PDTDVPAIRV TSNETRLVRS APDQVDKVPV FVLAEMPPGG SGEGPVIVEG PFFTARVLPG WQFRVTASGD LLLTDTH
|
| |