Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_1246 |
Symbol | |
ID | 5669659 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | - |
Start bp | 1499491 |
End bp | 1501374 |
Gene Length | 1884 bp |
Protein Length | 627 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641240178 |
Product | Xaa-Pro aminopeptidase |
Protein accession | YP_001505606 |
Protein GI | 158313098 |
COG category | [E] Amino acid transport and metabolism |
COG ID | [COG0006] Xaa-Pro aminopeptidase |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 10 |
Plasmid unclonability p-value | 0.623221 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 5 |
Fosmid unclonability p-value | 0.00858649 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGAGAAGTC CCGCCATCGG GCCGGCCGTG GTCGCCACGG CGAAGGACGC TACCCGAGCA CGGCGGAAGC GGCCTACCGT TAGCGGCATG AGTGCCCAGT CCCACCAAAC ACCAGCTTCA CGCCCGGTCG CGGGGCGATC CCCCGGCGGT ACGGCCGACG ACGTCGCACA CCGGACCACT CCCGCTTCGG GCGGCGCGGA AGTGGAACGC GTCGCATCTT CGCCGGACAC CGGCACACCG GGGCTCCCGG TGCCCCCCGC CGTACCACCG GGGAAATCGA TCCAGCCGGA TCTGGAATCT CCGGAACCGA CCATCGGGAC AACCCGGGCA CAGGAGCCGG CGGGGGTAAC CATCGCACCG GGGACTCCCC CCACCGCCGC AGACAGTCCG ACGACGGAGG AACAAGCCGC GGACGAACCG GGCAGGACCG GACAGGCTAA GCACGACGAG GAACCCCACG CGGCCTTCAG GCGTTTCATG GCGAGCGGCT GGGCGCCGGT GGATGATATC GTCGCGGTGC GTGACGACTG CGCCCCTTAC ACCACGAAAC GGCGCTCGCT TCTGGCCACC CGATTCCCCA CCGAAGCGCT TGTCATACCA AGCGGCGGGC TACACGTCCG GGCGAACGAC ACCGATTACC CGTTCCGCCC CGGCAGTGAC TTCTTCTGGC TCACCGGATG TCACGAACCG GACGCCGTCC TGATCCTGCA TCCCACCGCT GCCGGCGACC ATGACGCCGT GCTCTATCTC GCTGACCGAT CCGACCGATC GAGTTCCGCG TTCTACACGG ACCGCCGTTA CGGCGAACTG TGGGTGGGCC CCCGGCCCGG TGTACGGGAG ACCACAGCGG CTCTCGACAT CGAATGCCGG CCGCTGCCGG AGCTTCCCGA AGCACTGGCC CGTCTCGCGC CCGCCAGGAC CCGCGTCGTG CGCGGGCTGG ACGCCCGGGT GGACCGCGCG GTGAGCCGGT GGTCGCCGAC CGGCTCGTCC GCCGACCGGG ACGCCGCGCT GGCCGAGGTG CTGTCCGAGC TCCGGCTGGT CAAGGACGAC TTCGAGATCG CCCGGCTGGA CGAGGCGGTC GCGGCCACCG TGCTCGGTTT CACCGAGTGC GTGGGCGAGC TCGGCCGCGC GGCGACGCTC CCCAACGGGG AGCGCTGGCT GGAGGGAACC TTCTGGCGAC GGGCCCGCGT CGACGGCAAC GACGTCGGAT ACGGCTCCAT CGTGGCCTGC GGCCCGCACG CCACGACCCT GCACTGGGTG CGCGACGACG GCCCAGTGCG GCCCGGTGAC CTGGCACTTC TCGACATGGG GGTCGAGGGC CGCTCGCTGT ACACCGCCGA TGTGACGCGG ACGCTGCCGG TGAGCGGGCG CTTCAGCCCG CTGCAGCGCC AGGTTCACGA GGTCGTCTAC CGGGCCCAGC AGGCCGGGAT AGACGCGGTC CGTCCCGGCG CCGCCTTCCT GGATCCGCAC CGGGCCGCGA TGCGGGTGAT CGCGCAGGCG CTGCACGACT GGGGATTGCT GCCGGCCACG GTCGAAGAGT CGCTCAGCGA GGACCCGAAA GCACCCGGGG CCGGCCTGCA CCGGCGCTAC ACACTGCACT CCACGTCGCA CATGCTCGGG CTGGACGTGC ACGACTGCGC GCAGGCGCGC GACGAGACCT ACCGCGACGC CGCGCTGGAG GCCGGGATGG TGCTGACGGT CGAACCAGGG CTGTACTTCC AGCCGGACGA CCTCACTGTT CCACCGGAGC TGCGCGGGAT CGGCGTCCGG ATAGAGGACG ACATCCTGGT CACGCCGGAT GGAAGTCGGA ACATGTCAGC CGCGCTCGCA CGCTCGGCCG ACGATGTCGA AAAGTGGATG GCCGGCGAGG CCGCCAGGCA CTGA
|
Protein sequence | MRSPAIGPAV VATAKDATRA RRKRPTVSGM SAQSHQTPAS RPVAGRSPGG TADDVAHRTT PASGGAEVER VASSPDTGTP GLPVPPAVPP GKSIQPDLES PEPTIGTTRA QEPAGVTIAP GTPPTAADSP TTEEQAADEP GRTGQAKHDE EPHAAFRRFM ASGWAPVDDI VAVRDDCAPY TTKRRSLLAT RFPTEALVIP SGGLHVRAND TDYPFRPGSD FFWLTGCHEP DAVLILHPTA AGDHDAVLYL ADRSDRSSSA FYTDRRYGEL WVGPRPGVRE TTAALDIECR PLPELPEALA RLAPARTRVV RGLDARVDRA VSRWSPTGSS ADRDAALAEV LSELRLVKDD FEIARLDEAV AATVLGFTEC VGELGRAATL PNGERWLEGT FWRRARVDGN DVGYGSIVAC GPHATTLHWV RDDGPVRPGD LALLDMGVEG RSLYTADVTR TLPVSGRFSP LQRQVHEVVY RAQQAGIDAV RPGAAFLDPH RAAMRVIAQA LHDWGLLPAT VEESLSEDPK APGAGLHRRY TLHSTSHMLG LDVHDCAQAR DETYRDAALE AGMVLTVEPG LYFQPDDLTV PPELRGIGVR IEDDILVTPD GSRNMSAALA RSADDVEKWM AGEAARH
|
| |