Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2191 |
Symbol | |
ID | 3906791 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 2564221 |
End bp | 2565186 |
Gene Length | 966 bp |
Protein Length | 321 aa |
Translation table | 11 |
GC content | 66% |
IMG OID | 637879523 |
Product | prolyl aminopeptidase |
Protein accession | YP_481289 |
Protein GI | 86740889 |
COG category | [R] General function prediction only |
COG ID | [COG0596] Predicted hydrolases or acyltransferases (alpha/beta hydrolase superfamily) |
TIGRFAM ID | [TIGR01249] proline iminopeptidase, Neisseria-type subfamily |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 19 |
Plasmid unclonability p-value | 0.73475 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGGGCAGC CGTACCGGCC TATCGAGCCA TACGCGCACG GCTTCCTCGA TGTAGGCGAG GACAACCAGA TCTATTGGGA AACCAGCGGC AACCCAGACG GCAAGCCGGC GTTGTGGGTG CACGGAGGTC CCGGCAGCGG CGGGCGCCGC GGCAGCCGCA GGATGTTTGA TCCGGACGTC TACCGGATCA TCCTGTTCGA CCAGCGTGGC TGCGGCCAGA GCCGACCGCA CGCCAGCGAC GCGACCGTCA GCCTGGAACA CAACACCACC CGGCACCTGA TCGCCGACAT GGAGAGGCTC CGCGAACATC TCGGCATCGA CCGGTGGCTC TTTTACGGCA GCTCATGGGG CTCGACGCTG ATACTCGCCT ACGCCGAGCG CTACCCTGAA CGGGTCTCCG AGATCATCAT CGTCGGAGTC ACCATGACCC GGCCCGAAGA AATCGACTGG CTCTACCGCG GCGTCGGGCG CCTGCTGCCA GCCGCCTGGG AAGCGTTCCG CGACGCCGTG CCGAAGGACG ACTGGGACGG CAGTCTCGTT GCTGCCTACA ACCGGCTGTT GGGCAGCCCC GACGAGGCGA TAAGAATCAC CGCGGCGCGG GCCTGGTGCG CCTGGGAAGA CGCCGTCATC GCGCACGAAG CGCTTGGGGC GCCAGGCCAG TACAGCAGCA AACCCGACGA TGCGCTCGTG GCCTTCGTCC GCATCTGCAC GCACTACTTT GCAAACAACG CCTGGCTGGA GGACGGGCAG TTGCTACGCG ACGCTCACCG GCTGGCCGGG ATTCCCTCGG TGTTGATCCA CGGCCGACTT GACCTTGCCA GCCCGCTCAA GACCGCCTGG GATCTCGCCA AGGCATGGCC CGACGCGGAA CTGAAGATCA TCGACAATGC AGGCCACACG GGCAGCCCCG CGACCCAGAA CGCCATCATC GAGGCGACCG AACGGTTCAG CACGAACGGA CACTGA
|
Protein sequence | MGQPYRPIEP YAHGFLDVGE DNQIYWETSG NPDGKPALWV HGGPGSGGRR GSRRMFDPDV YRIILFDQRG CGQSRPHASD ATVSLEHNTT RHLIADMERL REHLGIDRWL FYGSSWGSTL ILAYAERYPE RVSEIIIVGV TMTRPEEIDW LYRGVGRLLP AAWEAFRDAV PKDDWDGSLV AAYNRLLGSP DEAIRITAAR AWCAWEDAVI AHEALGAPGQ YSSKPDDALV AFVRICTHYF ANNAWLEDGQ LLRDAHRLAG IPSVLIHGRL DLASPLKTAW DLAKAWPDAE LKIIDNAGHT GSPATQNAII EATERFSTNG H
|
| |