Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Francci3_2904 |
Symbol | |
ID | 3903968 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. CcI3 |
Kingdom | Bacteria |
Replicon accession | NC_007777 |
Strand | - |
Start bp | 3419150 |
End bp | 3420511 |
Gene Length | 1362 bp |
Protein Length | 453 aa |
Translation table | 11 |
GC content | 69% |
IMG OID | 637880225 |
Product | hypothetical protein |
Protein accession | YP_481991 |
Protein GI | 86741591 |
COG category | [S] Function unknown [T] Signal transduction mechanisms |
COG ID | [COG2013] Uncharacterized conserved protein [COG2310] Uncharacterized proteins involved in stress response, homologs of TerZ and putative cAMP-binding protein CABP1 |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 15 |
Plasmid unclonability p-value | 0.48904 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 0.788962 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGTTGGTCA CCCAGTTCAG TCGAGGGCAG AAGGCGCAGC TGTCGACGAT CACCCCGGGC ACCGACCTGT ACGTCGGGAT CCAGATCAAT GCGCCGGGGG AGTGGGACAT CTCCTGCTTC GGCCTGGACG GCGCCGATCG CCTCACCGAC GACCGCTACT TCGTCTTCTT CAACCAGCCG GCGTCGCCGG AGAAGTCCAT CCAGCTCCTC GGGCCGCAGG CCGGCGACAG TCAGGCGTTC CGGGTCACCC TCGACCAGGT GCCCGCCTCG ATCGGCCGGC TGTCGTTCTG CGCCGCCCTC GACGGTGCCG GTAGCGCCTC GGCTATCTCG TCGGGGTACC TGCGGGTCGT CGTCGCGGGA ACCGAGGTGC TGCGCTACTC CTTCACCGGC GCCGACTTCA CCAGCGAGCG CGCCGTGATG ATCGCCGACA TCTACCGCAA GGGTGTGTGG CGGATCGCCG CCGTGGGCCA GGGGTTCCAG GGGGGACTGG CCGAGCTCAT CCGCTCTTAC GGCGGTGAGG TCGCCGACGA GGCGCCGCAG CCGGCCCCCG CTCCCGGTTT CGGCGCTCCC GGTTTCGGCG CTCCGCTCGC TGCACCGCCG CCCATGGGCT TCGCGGGTGG GATGCCCGCC ACCGGCATGC CGGGTGGGAT GCCGCCGACC GGCGCGTTCG GCGGACCCGA GATCCTGCCG TCGCAGGCGC GACCCACCCT GCCCGGGGCG ATGAACAGCC TGGATCCCTA CCGTGAGGTG CCGACCGCCG GTCGGTGGAC CCAGCAGAAC GGCAAGCTGG TCAAGGTCAC CCTCGGTCCG GACGCGTTGG CGCTGCGCGG GTCGATGGTC GCCTACCAGG GCAGCGTCGA GTTCGACTAC AAGAGCAGCG GGCTGCGCGG TCTCATCGAG GGCAAGCTGA CCGGCCAGAG CCTCAAGCTG ATGACCTGCA AGGGCTCGGG TGAGGTCTTC CTTGCCCAGG ACGCCGCGGA CCTGCACATC GTCGAGCTCG GCAGCTCCTC CCTCTGCGTC AACGCGAAGA ACCTGCTGGC GATGGACGCC ACCGTGCGCA CCGAGGTGCG TCGGATCGAG AGCCCCGGCA TCCCCGGCGG GGGGTTCTTC CACTTCGAGG TCTCCGGGCC GGGTTCGGTG GTCGTCATGA CCAAGGGGTC TCCGATGACC CTCACCGTGC AGGGCCCGAC CTTCGCGGAC ATGAACGCCC TCGTCGCCTG GACCACCGGC ATGCGGGTCA GCGTGTCCAC TCAGGTCCGG ATCTCCCGGC AGATCTACGC CGGCGGTAGC GGCGAGGCAC TGGCTCTGCA GTTCATGGGG TTCGCGGGGC ACTTCATCGT CGTCCAGCCG TACGAGGTCT GA
|
Protein sequence | MLVTQFSRGQ KAQLSTITPG TDLYVGIQIN APGEWDISCF GLDGADRLTD DRYFVFFNQP ASPEKSIQLL GPQAGDSQAF RVTLDQVPAS IGRLSFCAAL DGAGSASAIS SGYLRVVVAG TEVLRYSFTG ADFTSERAVM IADIYRKGVW RIAAVGQGFQ GGLAELIRSY GGEVADEAPQ PAPAPGFGAP GFGAPLAAPP PMGFAGGMPA TGMPGGMPPT GAFGGPEILP SQARPTLPGA MNSLDPYREV PTAGRWTQQN GKLVKVTLGP DALALRGSMV AYQGSVEFDY KSSGLRGLIE GKLTGQSLKL MTCKGSGEVF LAQDAADLHI VELGSSSLCV NAKNLLAMDA TVRTEVRRIE SPGIPGGGFF HFEVSGPGSV VVMTKGSPMT LTVQGPTFAD MNALVAWTTG MRVSVSTQVR ISRQIYAGGS GEALALQFMG FAGHFIVVQP YEV
|
| |