Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_4320 |
Symbol | |
ID | 3907289 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 5161420 |
End bp | 5162955 |
Gene Length | 1536 bp |
Protein Length | 511 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 637881648 |
Product | hypothetical protein |
Protein accession | YP_483395 |
Protein GI | 86742995 |
COG category | [R] General function prediction only |
COG ID | [COG0312] Predicted Zn-dependent proteases and their inactivated homologs |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 30 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 14 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAGCCGGC CGGCCGGACC GCAACCAGCC TCAGCCGACG GAGGAAGCAC GGTGCGCGAG ACCGCCCATG AGATCGTCGA ACGCGCCCTG GGCCTGTCGA AGGCGGACGG CTGCGTAGTG ATCGCCACCG AGTCCAGCGC GGTGAACCTG CGCTGGGCCA ACAACACGTT GACGACCAAC GGCGCCAGCC GGGATCGGTC GATCACCGTC ATCAGCGTCA TCGGCCGCTC GTTCGGGGTG CGGACGGCCT CCACCATCGA CTCCCCCGGT TCCGGCGGCC ACCCCACCGC GCGTCCCACC GACCTGCCGC ATCCCACCGA TGCCGACGGC CTCGCGGAAC TCGTCCGCGC CGCCGAGGAC GCGGCCCGGG ACGCCGAGGA TGCCGAGGAC TACACGGACC TGCTCGGGCT GGACACGACG GACGTCGGCG CGGGTGCGGG GTCGGCGGGT GCGGGGTCGG CGGGTGACGC GGCCGAGGCC TTCACCGATC CGGCCGCCCG GACCAGCACC ACGGTGTTCG CCTCCTTCGC CCGGGACCTC GCGGAGGCGT TCGCCGCGGC GCGGGCGGGC GGGCGGCGGT TGTTCGGCTT CGCCGAGCAC AATCTGACCA CCACGTGGCT CGGGACGTCG ACGGGGCTGC GGCTGCGGTA CAGCCAGCCG ACCGGCAGCG TGGAGTGGAA CGCCAAGAGC GGCGTTCCCG GCGGCTCGGT CTGGCACGGC CAGTCGACCC GTGACTTCAC CGACGTCGAC GTCGCGGGTA CCGACGCGGC GCTGCGCGAC CGGCTGGCCT GGTGCGAACG GTCGCTGGAA CTGCCGGCCG GCCGGTACGA GACGCTGCTG CCGCCCTCCG CGGTGGCGGA TTTGATGATC TACATGTACT GGACGGCTGC CGGTCGGGAC GCGGCCGAGG GCCGGACGGT CTTCAGCCGG GCCGGCGGCG GCACGCGTCT CGGCGAGGCG ATCGGACCGG CGGGACTGCG GCTGGCCAGC GATCCGCACG ACCCGGAACT CGCGACGACC ACCTTCGTGA CCGCGCAGTC CTCCTCGTCG ATGTCCAGCG TCTTCGACAA CGGACTGGCG CTGAGGCCCA CCGACTGGAT CTCGGATGGC ACCCTCGCCG CCCTCGTGGA GACCCGTGCG TCCGCCCGTG CCACCGGGGT CCCGACCACC CCGATGATCG ACAATCTGAT CCTGGACGGC GGAGGCTCCG CGTCGTTGCA GGAGATGATC GCCTCGACGA AACGCGGGCT CCTGCTCACC AGCCTGTGGT ATATCCGCGA AGTCGATCCC GAGGTGCTCC TGCTCACCGG CCTCACCCGG GACGGGGTCT ACCTGGTGGA GAACGGTGAG GTCACCGGGG CCGTCAACAA CTTCCGCTTC AACGAGTCAC CGGTCGACCT GCTCGGCCGC CTGGCCGAGA TCGGCGCCAC CACCCGCACC ATGGCGCGGG AATGGGCGGA CTGGTTCACG CTCACCCGCA TGCCCGCGGT ACGGATCCCC GACTTCAACA TGTCCTCGGT CAGCCCGGCG AACTGA
|
Protein sequence | MSRPAGPQPA SADGGSTVRE TAHEIVERAL GLSKADGCVV IATESSAVNL RWANNTLTTN GASRDRSITV ISVIGRSFGV RTASTIDSPG SGGHPTARPT DLPHPTDADG LAELVRAAED AARDAEDAED YTDLLGLDTT DVGAGAGSAG AGSAGDAAEA FTDPAARTST TVFASFARDL AEAFAAARAG GRRLFGFAEH NLTTTWLGTS TGLRLRYSQP TGSVEWNAKS GVPGGSVWHG QSTRDFTDVD VAGTDAALRD RLAWCERSLE LPAGRYETLL PPSAVADLMI YMYWTAAGRD AAEGRTVFSR AGGGTRLGEA IGPAGLRLAS DPHDPELATT TFVTAQSSSS MSSVFDNGLA LRPTDWISDG TLAALVETRA SARATGVPTT PMIDNLILDG GGSASLQEMI ASTKRGLLLT SLWYIREVDP EVLLLTGLTR DGVYLVENGE VTGAVNNFRF NESPVDLLGR LAEIGATTRT MAREWADWFT LTRMPAVRIP DFNMSSVSPA N
|
| |