Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2300 |
Symbol | |
ID | 3904834 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | + |
Start bp | 2682410 |
End bp | 2683447 |
Gene Length | 1038 bp |
Protein Length | 345 aa |
Translation table | 11 |
GC content | 76% |
IMG OID | 637879631 |
Product | pentapeptide repeat-containing protein |
Protein accession | YP_481397 |
Protein GI | 86740997 |
COG category | [S] Function unknown |
COG ID | [COG1357] Uncharacterized low-complexity proteins |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 0.000752712 |
Plasmid hitchhiking | No |
Plasmid clonability | decreased coverage |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.0184634 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | TTGCCCGACG ATCTCGATCC CGCCGCGGGA ATCACCGGGG ACGCCCGCGG CCGGCTGCGT GCCGACTGCG GGCGCTGTTT CGCGCTGTGC TGCGTGGCAC CGGCGTTCTC CGCCTCGGTG GACTTCGCGA TTGACAAACC CGCCGGCACG GCCTGCCCCA ACCTGAACGC GGGCTTCCGC TGTGCCGTCC ACGAGCAGCT GCGGCCGCGG GGGTTCCGCG GCTGCGCCGT GTATGACTGC TTCGGCGCCG GGCAGCAGGT CTGCCAGGTC ACCTACGGCG GGCGGGACTG GCGAGCGGCC CCGGACAGTG CCCGGCAGAT GTTCGATGTC TTCGCGGTCA TGCGGACGCT GCACGAGCTG CTGTGGTATC TGAGCGAGGC GCTGAGCCGG CCCGCCGCCG CCCCGCTGCA CGCCGAGCTG CATGACGCCC GCGCTCGCAC CATGCTGCTG ACCTGCGCCG ACGCCGACAC GCTGCTCGCC GTGGACGTCC CCGCGCTGCG TCAGGCGGTC CATCTGCTGT TGCTGCAGGC CAGCGACCTG GTCCGCGCCG AGCTGCCGGG CGGTGCCGGG AACAGGCGTC CCCGGCGGCG CCCCCGCGGC GCCGACCTGC TCGGCGCGAA CCTCGCCGGG GCGGATCTGC GCGGCGCGGC GCTGCAGGGA ACCCTCCTGA TCGGCGCGGA TCTGCGGGGG GCGGATCTGC GCGGCGCCGA CCTGCTCGGC GCTGACCTGC GCGACACCGA CCTGCACGGC GCCAACCTGC ACGGCGCCCT GTTCGTCGTT CAGGCCCAGC TCGACGCCGC CCGCGGGGAC GCCACCACGG CGCTGCCCCC GGCTCTGACC CGCCCCGTGC ACTGGACGCG CACGGCGGAC CCGACGCGGC CGGACCCCAG AACGCACCCC GTGGCGGGCT CGGGCCCCGG GCCCGGGACG GTGCCGATCC CCTCGTCGTC GCCCAGCGCG GCCCGGCCGC GGCGGTCGGC GAGTTCTCCT CGGTCGCCTC GTTCCCCTCG GTCGCCTCGT TCCCCACGCC GCCGCTGA
|
Protein sequence | MPDDLDPAAG ITGDARGRLR ADCGRCFALC CVAPAFSASV DFAIDKPAGT ACPNLNAGFR CAVHEQLRPR GFRGCAVYDC FGAGQQVCQV TYGGRDWRAA PDSARQMFDV FAVMRTLHEL LWYLSEALSR PAAAPLHAEL HDARARTMLL TCADADTLLA VDVPALRQAV HLLLLQASDL VRAELPGGAG NRRPRRRPRG ADLLGANLAG ADLRGAALQG TLLIGADLRG ADLRGADLLG ADLRDTDLHG ANLHGALFVV QAQLDAARGD ATTALPPALT RPVHWTRTAD PTRPDPRTHP VAGSGPGPGT VPIPSSSPSA ARPRRSASSP RSPRSPRSPR SPRRR
|
| |