Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_7239 |
Symbol | |
ID | 5675540 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 8839011 |
End bp | 8840720 |
Gene Length | 1710 bp |
Protein Length | 569 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641246076 |
Product | peptidase S1 and S6 chymotrypsin/Hap |
Protein accession | YP_001511464 |
Protein GI | 158318956 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG0265] Trypsin-like serine proteases, typically periplasmic, contain C-terminal PDZ domain |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.108633 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 9 |
Fosmid unclonability p-value | 0.212332 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGACTACCA CGCACGACGC CCGGTCCCCT TGGGCTCCGC CCGACAACGC GCCGGAGCGG ACCAGGGCCG AGGCGAACAG TTCGGAGGGG TCGGTGGACC CGCGGTCGAA TCCCGTCGAA CACCAGCAGT ACCCCGGTAC GCCGCCGGCC GGCGGCCCGC CGCCCGGGCC CGGCCCGAGC CGTCCTGACC AGACCGGCGG TTTTCCGGCG GGCCCCCCGC CGGCGCCGCA CCCGGCCGGG CCGGCGCCCG GCACGGGCGC GCCCGGCGGG CCCTACGAGC CGGGCCCGCA GCAGGGCCCC CGGCCGACCG CGCCCTACGC CCAGACCTCC GGCTGGGCCG ACCCCCGCTC CACCGCCGGC GCTGGCGACA GCACGCAACG GGTCGACCAG GGCGGCCCGG CCACCGGCCA GGGCCAGCAG CAGTCCGGCG CGTGGTGGAA CCCCGGGCAT CCCGGCTGGG GCGCCCCGGT GCCGCCCGGA GCGGCTCCCG GCGGTCAGGC ACCCGGTGGC CAGGCGCCCG GGTCACCGGG CGGCTCCGAT CCCTACGGGC GGTTCACCCC GGCGAAGCCG AACCCGGCGC CCATGCGCCG GCGTCGGATG ATCGCGGCCG CGCTGGCGAT CGCGCTCGTG TCGGGCGGCA TCGGCGGCGG CGTCGGCGCA CTCGTCGCCA GCGACGACTC CCCGGCGGTG GTGACCTCCT CCGCCGGCCT GCGGCAGTCG ACCGGCACGG CCGGCGTCTC CCCGGCCGCG GACAACACGG TCGCCGCGGC CGCGCAGGCG ATCCTGCCCA GTGTCGTGAC GATCGCCGAA CAGTCCAGCC AGGAGTCGGG CACCGGCTCC GGGACGATCA TCCGCGCGGA CGGCTACATC CTCACGAACA ACCACGTGGT GTCCGGCGCC TCGCAGGGCG GCACCCTCAC GGTCACGATG CAGGACGGCC GGACCTTCGA CGCGCAGGTC AAGGCGACCG ACCCGAGCTC CGACCTCGCC GTGGTCAAGA TCGACGCTAC CGGCCTGCCC GCTGCCACGT TCGGGGACTC CGACGCACTG CAGGTCGGCG AGCTCGTCGT CGCTGTCGGC AGCCCGCTGG GCCTCAACGG GACGGTCACC TCCGGCATCG TCAGCTCGCT GCACCGGCCC GTCCGCACCG GCGACGCGAC CGTGCGGGAC CAGCAGAACA CCGTCCTGGA CGCGATCCAG ACCGACGCGC CCATCAATCC CGGCAACTCG GGTGGGCCGC TGGTGAACAG CAAGGGCGAG ATCATCGGCG TGAACACCGC AATCGCGACG GTCGGCGGCA GTTCACCCTT TGGTGGTAGT CAGCAGTCCG GAAACATCGG GGTTGGTTTC GCGATTCCGG GCAACTACGC GGAGAAGGTC GCCGGCCAGC TCGTCGACAA CGGCGCAGCG CAGCACCCCT ATCTGGGTGT GAGCGCCTCC ACCGCCGACG AGAACACCCG GTCGACGGCC GCGAGTGGGA CGGGCGCGCA GATCCGCTCC CTGGTCAGTG GAGGCCCGGC AGACAAAGCG GGCCTGCACG TAGGTGACGT CATCACCAAA GTGGGGGATC GCGCCGTCAC CGACGTGGAT TCGCTGATCG CCGCCGTCCG GTCCTACGAG ATCGGCAACC AGGTGCAGGT CACCTACCAG CGCGACGGCT CCAGCCAGAC CGCGACGGTC ACGCTGCTCG AACAACCGCC CAATTCCTGA
|
Protein sequence | MTTTHDARSP WAPPDNAPER TRAEANSSEG SVDPRSNPVE HQQYPGTPPA GGPPPGPGPS RPDQTGGFPA GPPPAPHPAG PAPGTGAPGG PYEPGPQQGP RPTAPYAQTS GWADPRSTAG AGDSTQRVDQ GGPATGQGQQ QSGAWWNPGH PGWGAPVPPG AAPGGQAPGG QAPGSPGGSD PYGRFTPAKP NPAPMRRRRM IAAALAIALV SGGIGGGVGA LVASDDSPAV VTSSAGLRQS TGTAGVSPAA DNTVAAAAQA ILPSVVTIAE QSSQESGTGS GTIIRADGYI LTNNHVVSGA SQGGTLTVTM QDGRTFDAQV KATDPSSDLA VVKIDATGLP AATFGDSDAL QVGELVVAVG SPLGLNGTVT SGIVSSLHRP VRTGDATVRD QQNTVLDAIQ TDAPINPGNS GGPLVNSKGE IIGVNTAIAT VGGSSPFGGS QQSGNIGVGF AIPGNYAEKV AGQLVDNGAA QHPYLGVSAS TADENTRSTA ASGTGAQIRS LVSGGPADKA GLHVGDVITK VGDRAVTDVD SLIAAVRSYE IGNQVQVTYQ RDGSSQTATV TLLEQPPNS
|
| |