Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2992 |
Symbol | |
ID | 3905489 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3542899 |
End bp | 3544368 |
Gene Length | 1470 bp |
Protein Length | 489 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 637880312 |
Product | amidohydrolase |
Protein accession | YP_482078 |
Protein GI | 86741678 |
COG category | [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG1228] Imidazolonepropionase and related amidohydrolases |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 22 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 0.278958 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACCGATC CCCCGGCATC ATCGGAACAA TCCCTGACTC CGGAGCAGCC TCCGGCGCAC GGGCGATCCA CCGCGACGGA ACAGTCCGAG TCCGGGCGTC CCGAGTCCGG GCGGTCGACG GTGCCCGGAC GATCAGAGGC GGCCGAGGCC TTCGTCCTGC GGGGGGTGCA CGTCCTCGAC AGGACCGGGC GCTTCGGTGA GCCGACCGAC GTCGCCGTCG GCGGCGGGAT CATCCATGCG GTCGGCACGC GACTCGGTCT TGACCGGGAC GCCGTCGACG TTGACGCCCA CGACCTGTGG CTGATGCCAG GGGTCGTCGA CTGCCACGCC CACGTCACGC AGTCCACCTA TGATCCGTTC GAGCTGGCGA CGGCGTCGCT GTCCACCCGG GTCCTGCAGA CCGCCGCCGC GTTGCGGGCG TCGTTGGCGG CCGGTGTCAC ACACCTGCGG GACGCCGGCG GCGCCGACGC CGGCATCCGG GACGCGGTCA GCGCGGGCAC CGTGCCCGGA CCTCGGCTTG CCGTGTCCGT CGTCGGGCTG AGCCGGTCCG GGCGGCACGG TGACGGGGCG ATCATCGGTC CCGGGCTGGA GTCGGCCGAA GATCTTCTCA TGCCGGACTA TCCCGGCCGG CCCCCGCATA CCGTTACCGA GGCGGGCAGC CTGCCGGCCT CGGTGCGGGG CATTCTGCGG GCGGGTGCGG ACTGGATCAT GATCTACGCG AGCGGCGGGG TGATGTCCGC CCGGCCGGGG CAGCCCGAGC CGCAGTTCAG CCCGGTGGAG CTTGCGGCCG CCGTCGCGGA GGCCCGGCGG TACGGACGTC CGGTGATGAT GCACGCCTTG GGCGAGCGGT CGATCGAGGC CGCGGTCGCA GCCGGAGCGC GTTCCATCGA ACACGGCATC GGGCTCACCG AGCCCGTGGC GGCCGCCATG GCGGCCGCCG GGGTGACGCT CGTCCCCACC CTGTCGCCCT ACCAGGACCT GGCGGCGCTG GCGGCCACCG GAGTGCTGCC GGGCTGGGCG GCGGACCGAG CCGAGGCCAC CGAGGCCGCG CTGGCCGGCA CGATCGCGGT CGCCCGTGCG GCGGGGGTGC CGATCGCGCT CGGCAGCGAC GCCCGGCACC GCACCCGGCA CGGCGCGAAC CTGGCCGAGA TCAGCCGGCT GCGCCATGCC GGGCTCACCC CACCCGAGGC GCTGCTCGCC GCGACCGCGA CCGGAGCCCG GCTCTTCGGA CTGGGTGAGG GAGCCGGCCG CATCGCCGTC GGCTCCGCCT TCGACGCGAT CCTGCTCGAC GCGGACCCCG GGGACCTGTC GATCTTCGAG CGGCCGGGCG CTGTGAGCGG AGTGTTCCTT GGCGGTCGCG CGGTGCTGCC CCACCCGCGG CTGCCCGGGA AGCTGGTCCG CCCCACCATG GTGATGGAAG AGGTGCCGAG AATCGTTCCC GAGCGGCCCG CGGTGTCGCC GCCCGGCTGA
|
Protein sequence | MTDPPASSEQ SLTPEQPPAH GRSTATEQSE SGRPESGRST VPGRSEAAEA FVLRGVHVLD RTGRFGEPTD VAVGGGIIHA VGTRLGLDRD AVDVDAHDLW LMPGVVDCHA HVTQSTYDPF ELATASLSTR VLQTAAALRA SLAAGVTHLR DAGGADAGIR DAVSAGTVPG PRLAVSVVGL SRSGRHGDGA IIGPGLESAE DLLMPDYPGR PPHTVTEAGS LPASVRGILR AGADWIMIYA SGGVMSARPG QPEPQFSPVE LAAAVAEARR YGRPVMMHAL GERSIEAAVA AGARSIEHGI GLTEPVAAAM AAAGVTLVPT LSPYQDLAAL AATGVLPGWA ADRAEATEAA LAGTIAVARA AGVPIALGSD ARHRTRHGAN LAEISRLRHA GLTPPEALLA ATATGARLFG LGEGAGRIAV GSAFDAILLD ADPGDLSIFE RPGAVSGVFL GGRAVLPHPR LPGKLVRPTM VMEEVPRIVP ERPAVSPPG
|
| |