Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_3776 |
Symbol | |
ID | 5672141 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 4477008 |
End bp | 4478171 |
Gene Length | 1164 bp |
Protein Length | 387 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641242657 |
Product | nuclease SbcCD, D subunit |
Protein accession | YP_001508077 |
Protein GI | 158315569 |
COG category | [L] Replication, recombination and repair |
COG ID | [COG0420] DNA repair exonuclease |
TIGRFAM ID | [TIGR00619] exonuclease SbcD |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.0285977 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 19 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAGTTCC TGCACACCTC GGACTGGCAT CTCGGGAAGA CCCTCAAGGG GCGCAACCGG CTCGACGAGC AGCGCGCCGT GCTCGGGGAG ATCATCGGGA TCGCCCGGAA GCACGAGGTC GACGCGGTGC TCGTCGCCGG GGACGTCTAC GACAGCGCGG CCCCGCCGGC GGCGGCCCAG CAGCTCGCCG TGCAGGCGCT GGTCGCGTTG CGCGGCACCG GCGCCGAGGT GGTCGTGATA GCCGGCAACC ACGACCACCA GCCGACCTTC GACGCCTACC GGCCGCTGAT GTCGGCCGCC GGGATCACCC TGGTCGGGAC GCCGCGCACC GCCGCCGACG GCGGGGTGGT CACCTTCCGC GCCCCCGGCA CCGGCGAACC GGTCACCGTG GCGGCGTTGC CGTTCGTCTC CCAGCGGTAC GCCGTCCGCG CCGCCGACCT GGTGACCCAG ACGCCCGACC AGACCGCGGC CCGCTACGAC CAGGTCGTGC GCGCCCTGAT GGAACAGTTG CGCTCCGGCT TCGACGACAA CGCGGTGAAC CTCGTCCTGG CCCACCTGAC GGTGACGAAC GGCCTGTTCG GCGGCGGGGA GCGGATGGCC CAGTCGATCT TCGAGTACCA CGTGTCCGCG GCGGCGTTCC CCGCGGACGC CCACTACGTG GCGCTGGGGC ACCTGCACCG GCGGCAGACC CTCGCCGCGC CGTGCCCGGT GGTGTACTCC GGCGCGCCGC TCGCGGTCGA CTTCGGCGAG CAGGAGAACA CGAACGTGGT GTGCCTCGTC GAGGCGAGCC CCGGCGTCCC CGCCCAGGTC ACCGACATCG CGCTGACGGC CGGCCGGCGG CTGCGGACGG TCCGGGGCAC CGTCGCGGAG CTTGCCGCGC AGGCCCCGGA GCTCGCCGAG GACCTGCTGC GGGTGGTCGT GCGCGAACCG GCCCGCGCGG GGCTGCGCGA CGAGGTCCAG GAGCTGCTGC CGAACGCGCT CGAGGTCGGG ATCGACCCGG AGCTCCGGGT GGCGCTCGGC GGGTCGCGCC CGAGCGCGGC GGCGCTGGCG CGGCGCAGCC CGGAGGACCT GTTCCGGGAG TTCTGCGCCG CCGAGCAGTT CGAGGATCCC CGGGTCGAGG CGCTGTTCGC CCGGCTGCAC GACGACGTCT CCAGCGGCGA CTGA
|
Protein sequence | MKFLHTSDWH LGKTLKGRNR LDEQRAVLGE IIGIARKHEV DAVLVAGDVY DSAAPPAAAQ QLAVQALVAL RGTGAEVVVI AGNHDHQPTF DAYRPLMSAA GITLVGTPRT AADGGVVTFR APGTGEPVTV AALPFVSQRY AVRAADLVTQ TPDQTAARYD QVVRALMEQL RSGFDDNAVN LVLAHLTVTN GLFGGGERMA QSIFEYHVSA AAFPADAHYV ALGHLHRRQT LAAPCPVVYS GAPLAVDFGE QENTNVVCLV EASPGVPAQV TDIALTAGRR LRTVRGTVAE LAAQAPELAE DLLRVVVREP ARAGLRDEVQ ELLPNALEVG IDPELRVALG GSRPSAAALA RRSPEDLFRE FCAAEQFEDP RVEALFARLH DDVSSGD
|
| |