Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_6663 |
Symbol | |
ID | 5674978 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8092844 |
End bp | 8093845 |
Gene Length | 1002 bp |
Protein Length | 333 aa |
Translation table | 11 |
GC content | 75% |
IMG OID | 641245514 |
Product | Nitrilase/cyanide hydratase and apolipoprotein N-acyltransferase |
Protein accession | YP_001510906 |
Protein GI | 158318398 |
COG category | [R] General function prediction only |
COG ID | [COG0388] Predicted amidohydrolase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 5 |
Plasmid unclonability p-value | 0.00315162 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.834445 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGAGCGCGG ACCGCGGGGA CGGCGCGCGT GGGGGGCCGG AGCCGGCCTC CCACGCGGTG ACGATCGCCG CGGTGACCGC GCCGTTCACC CGTGACCTGG ACGAATGCCT CGCGGCGATC AGTCGGCTGG TCGACGGGGC CCGGCGGCGC GGGGTGGATC TCCTCGTCCT GCCCGAGGGG GCGCTCGGCG GCTACCTGCG GGCGCTGCCC CCGCGCGGGG ACGATCTGGC CCCGCGCGGG GGGCCGCCCG CCCTCGATCC GGACGGTCCG GAGATCACCC GGCTGGCGGC GATCGCCGGG GATATGGTGG TCTGCGCGGG ATACGCGGAG CGTGACGGGC GGTACCGGTA CAACAGTGCG GTGTGTGTGC ACGGTGATGG CGTCCTCGGC CGGCATCGCA AGGTCCACCA GCCGCTCGGC GAGTCCCTCG CCTACGAGGC CGGCCGTTCC TTCACCGCGT TCGACAGCCC ACTCGGCCGG ATGGGGATGA TGATCTGTTA CGACAAGGCG TTCCCCGAGT CCGGGCGTAG CCTGGCGCTC GCTGGCGCGG ACATCATCGC CTGCCTGTCG GCCTGGCCGG CCTCGCGTAC CCACGCGGCC GACGACATCG CCGCGGACAG GTGGCGGCAC CGCTTCGACC TCTACGACCA GGTGCGCGCA TTGGAGAACC AGGTCGTGTG GGTCTCGTCC AACCAGGCCG GCACGTTCGG CTCGCTGCGC TTCGTCGGCA ACGCGAAGAT CGTCCATCCG GACGGCTCCG TGCTCGCCTC GACCGGCACG GGCGCGGGGA TGGCCGTCGC GACGGTCGAC GTCACCGCGG CGCTACGGGC GGCCCGCAGC GGCCTGAACC ACCTGCGGGA CCGCCGCGGC GCGAGCTACG AGAAGCAGTG CCTGCTCGCC GGTAAGCCCT ACGACCCGCG GCGGGCCGCC CGCCCGGCGG CCCCCCGGCC CACGGCCGAG CACGGATCTC GCCCGTCCCG GCCCGCGGGA CCGGCGCACT GA
|
Protein sequence | MSADRGDGAR GGPEPASHAV TIAAVTAPFT RDLDECLAAI SRLVDGARRR GVDLLVLPEG ALGGYLRALP PRGDDLAPRG GPPALDPDGP EITRLAAIAG DMVVCAGYAE RDGRYRYNSA VCVHGDGVLG RHRKVHQPLG ESLAYEAGRS FTAFDSPLGR MGMMICYDKA FPESGRSLAL AGADIIACLS AWPASRTHAA DDIAADRWRH RFDLYDQVRA LENQVVWVSS NQAGTFGSLR FVGNAKIVHP DGSVLASTGT GAGMAVATVD VTAALRAARS GLNHLRDRRG ASYEKQCLLA GKPYDPRRAA RPAAPRPTAE HGSRPSRPAG PAH
|
| |