Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2237 |
Symbol | |
ID | 3905005 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2608128 |
End bp | 2610044 |
Gene Length | 1917 bp |
Protein Length | 638 aa |
Translation table | 11 |
GC content | 68% |
IMG OID | 637879568 |
Product | hydantoinase/oxoprolinase |
Protein accession | YP_481334 |
Protein GI | 86740934 |
COG category | [E] Amino acid transport and metabolism [Q] Secondary metabolites biosynthesis, transport and catabolism |
COG ID | [COG0145] N-methylhydantoinase A/acetone carboxylase, beta subunit |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 13 |
Plasmid unclonability p-value | 0.570918 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.102983 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGAATCC TGGTCAATAT CGACAACGGT GGCACGTTCA CCGATGTGTG CGTGACGGAC GGTGAGCGCA TCGTGCATGC GAAGACGCCG ACGACCCCGC ACGATTTAAC GCAGTGTTTC GTCGACGGGC TGCGGACAGC GTCCGACCGG CTGTATGGCG AGGAGGACAC CGCTCGTCTG TTGCGCGAGA CGGAGTACCT GCGCTATTCG ACGACGTCGG GTACGAACGC CGTGGTCGAG CGGAAGGGCG CACCGGTCGC CCTGCTGGTC GACAGCGGTG CGGAGGAGGA CGTCTACGGC ATTGCGAACC TGGTCGACGC CTCGCTGTGG CAGGCGCTGG TGCCGCATTC TCCGGTCGGG ATCACGGTGG GCGCCGACGG GTCGGTGGGC TTGGCGGAGT TCACGACGGC AATCAACGAA CTGTTGGCGA CGAGTACCTC GCGGATCGTG ATCGCGCTGC GGAGCGCGGC GGCGGAGCGG GCGATCAAGA ATCTGTTGTT GGAGCGGTAC CCGCGGCATC TGCTGGGCGC GGTTCCGTTC ACCCTGTCGC ATGAGCTTGT GCACGACGTC GACGACGCGC GGCGGGTGCT GACCGCGGTG CTGAACTCCT ACCTGCACCC GGGGATGGAG CACTTCCTCT ACGGCGCGGA GAAGGCGTGC CGGGAGAACG GGCTGCCTCG CCCGCTGCTG ATCTTCCGGA ATGACGGGGA CTCGGCCCGG GTGGCCAAGA CCACGGCGTT GAAGACGTGG GGTTCCGGGC CGCGGGGTGG GCTGGAGGGC AGCGTCGCCT ACGCCTCGCT CTACGGCGCG GACGTGCTGG TCGGGGTGGA CGTGGGTGGC ACCACGACCG ACGTGTCGGT CGTGGTGGAC AAGGCCCTGA CGGTGCACGC GCACGGCCGG GTCGATTCGG CGCAGACCTC GCTGCCGATT CCGGACCTGA GCAGTATCGG GCTGGGTGGC AGTTCGGTGG TCCAGGTCGT CAACGGTCAG ATTCAGATCG GTCCGCGCAG TGTGGGTGCC GCGCCCGGCC CGGCGTCTTT CGGCCGTGGT GGCACCGATG CGACGGTGAC CGATGCCCTG CTGCTTGCCG GGGTGTTGGA CCCGGACAAC TATCTGGGTG GGGATCTGAA GTTGGACCCG GCCCGGGCGG AGCGGGCGTT GCTGACCCAT GTCGGGGAGC CCCTGTCGTT GTCGGCGCAG GCCGCGGCGC TCGCGGTGTT GCGGGTGTTC GAGGAGCAGG CGGGGGCCGC GGTCAAGGAG ATGATCTCCG CGGCGGGTCG TGAGCCGGGT GAGGCCACGC TGCTGGCCTT CGGTGGCGCC GGTCCGGTGC TTGCCTCGGG GATCGCGCGG GCGGCGGGGA TCGCACGGGT GATCGTGCCG CATCTGTCGG CGGTGTTCAG CGCGTTCGGT ATCGGTTTCA GCGGGTTGGC ACACGAATAC AGCGTGCCGA TGCCGGGCGT CGACGTCGAG GTGAAGGCGG CCCGCGACGA TCTGTTGACC CGCGCGCGGC GTGACATGTT CGGTGAAGGC GTGTCCATCG ACGAATGCAC TGTCGAGACC CGCGGCCGAT TCATCGTCGA CGGCGTGTTG CGGGACGAGA CGTGGACCGA CGGGTCGTCT CCCGGGCAGG CCGATCAGCT GGTGGTGCGG GCCTGGTATC CGCTGCCGAC CTTCGAGCTG GTGGCCGACG AACACGGCAC GGTCCAGCCG GCGGCCGCGG ACGGATCGCG TCGTATCCAT TTCGCTGACG GTAACGAGCA GGAGATTGCG GTCTACCGTC CGGAGAATCT GGAGCCGGGG CAGGGGGCCG CGGGGCCGGC GCTGGTCGCG GGTGACTATC TGACCTGTCT GATCGAGCCC GGGTGGGGGT TCCGGGTGAG CAGCAACTCT GACCTGATCC TGGAGGCCCA GCAGTGA
|
Protein sequence | MGILVNIDNG GTFTDVCVTD GERIVHAKTP TTPHDLTQCF VDGLRTASDR LYGEEDTARL LRETEYLRYS TTSGTNAVVE RKGAPVALLV DSGAEEDVYG IANLVDASLW QALVPHSPVG ITVGADGSVG LAEFTTAINE LLATSTSRIV IALRSAAAER AIKNLLLERY PRHLLGAVPF TLSHELVHDV DDARRVLTAV LNSYLHPGME HFLYGAEKAC RENGLPRPLL IFRNDGDSAR VAKTTALKTW GSGPRGGLEG SVAYASLYGA DVLVGVDVGG TTTDVSVVVD KALTVHAHGR VDSAQTSLPI PDLSSIGLGG SSVVQVVNGQ IQIGPRSVGA APGPASFGRG GTDATVTDAL LLAGVLDPDN YLGGDLKLDP ARAERALLTH VGEPLSLSAQ AAALAVLRVF EEQAGAAVKE MISAAGREPG EATLLAFGGA GPVLASGIAR AAGIARVIVP HLSAVFSAFG IGFSGLAHEY SVPMPGVDVE VKAARDDLLT RARRDMFGEG VSIDECTVET RGRFIVDGVL RDETWTDGSS PGQADQLVVR AWYPLPTFEL VADEHGTVQP AAADGSRRIH FADGNEQEIA VYRPENLEPG QGAAGPALVA GDYLTCLIEP GWGFRVSSNS DLILEAQQ
|
| |