Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5574 |
Symbol | |
ID | 5673902 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 6754895 |
End bp | 6756040 |
Gene Length | 1146 bp |
Protein Length | 381 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244428 |
Product | acetamidase/formamidase |
Protein accession | YP_001509832 |
Protein GI | 158317324 |
COG category | [C] Energy production and conversion |
COG ID | [COG2421] Predicted acetamidase/formamidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 12 |
Plasmid unclonability p-value | 0.49831 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 21 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCTGCG ACGGCCGCCG TCGGCACGGA CGGTCCCGGG GCGGGCGGAA CAACGAGGAG TGCGACATGT CCGTCCTGCA GTGCGGGTCC GGCGAGGTAC CGGGGGAGCA CTACCTGCCC TCGTCGCCCA AGACGGTCAC CTGGGGCCGG CTGCCCAGCG CGGCGACCGA CCCGGTGCTC GAAGTGGCGG CGGGCGCCAC CGTCACCATC GACACCGTCT CGCACGAGGG CCTCATGGAG GACCAGGGCC GCGACCCGGT CGCCTTCTGG GCGGAGCACG CCGTCGCGGC GGATTCGGTC CTCCTCGACG CGATCGACAT CGCCCGGGAC GTCTCCCACT ACCGCGGGCT CGACGGCCCG CACGTGGTCA CCGGTCCGGT GCGGGTCGAC GGCGCGCGCC CCGGCGACAT CCTCAAGGTC GAGTTCCTCG AGCTGCGCCC GCGGGTGCCC TACGGGCTCG TCTCCAGCCG GCACGGCCGG GGCGCGCTGC CCGGCGAGCT TCCCGTCGGC CCCGACGGCG TGCTCGCCGA CCGTTACAGC CAGTTCTGCG AGGTGGACGT CGCCGCCGGG CGGGCCGTGA TGCGCTACGG CGAGGGCCGG CAGATCAGCT TCCCGGTCGC GCCGTTCATG GGTCTCACCG GCCTCACCCC GGCCGGGGAG AAGGCCCTCA ACACGACCCC GCCCGGGGCA TTCGGCGGGA ACCTCGACGT GCGCGACCTC GTCGCCGGGT CGACGCTCTA CCTGCCGGTC CAGATCCCGG GCGCGGGCTT CTACACCGGT GATCCGCATT TCGCCCAGGG GCACGGCGAG GTGTCGCTGA CCGCCCTGGA GGCCTCGCTG CGCACGACCG TCCGGCTCAC CCCGCTTCCG GCCGCCGCGG CGCTGCCCTT CGGCGCGGGT TCCGGCGGCC CCTTCGGCGA GACCCCGGAG CACTGGATCG CCATCGGCCT GCACAACGAC CTCGACGAGG CCATGCGGCT GGCCGTCCGC GAGGCGCTGC GGGTGCTGCG CCAGGTCCGG GATGTCCCGG TGATGGTCGC GTACAGCTAC CTGTCGGCCG CCGCCGACTT CGTGGTCAGC CAGGTCGTCG ACGACGTGAA GGGCGTGCAC TGCCTGATCC GCAAGCGCGA CTTCCCGGCC TGGTAG
|
Protein sequence | MSCDGRRRHG RSRGGRNNEE CDMSVLQCGS GEVPGEHYLP SSPKTVTWGR LPSAATDPVL EVAAGATVTI DTVSHEGLME DQGRDPVAFW AEHAVAADSV LLDAIDIARD VSHYRGLDGP HVVTGPVRVD GARPGDILKV EFLELRPRVP YGLVSSRHGR GALPGELPVG PDGVLADRYS QFCEVDVAAG RAVMRYGEGR QISFPVAPFM GLTGLTPAGE KALNTTPPGA FGGNLDVRDL VAGSTLYLPV QIPGAGFYTG DPHFAQGHGE VSLTALEASL RTTVRLTPLP AAAALPFGAG SGGPFGETPE HWIAIGLHND LDEAMRLAVR EALRVLRQVR DVPVMVAYSY LSAAADFVVS QVVDDVKGVH CLIRKRDFPA W
|
| |