Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5902 |
Symbol | |
ID | 5674223 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 7169379 |
End bp | 7171256 |
Gene Length | 1878 bp |
Protein Length | 625 aa |
Translation table | 11 |
GC content | 74% |
IMG OID | 641244750 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001510152 |
Protein GI | 158317644 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 18 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 12 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | GTGACCGGCG CTGGACCGCG CGCCTCGACA GGGCCCCGAC AGTGGCCTGA GTCGCCCCGC ACCGGCTCCG ACCCCGCAGC GGGCGGGCGG CGCCGGCCGA GGTCACGGCC GCACGCACGC TCGCGCCAGC GCGAGGTCAC CCACTCGAAC CAGGACGGGT CGGAGACCGA GCTCACCGAG CCGAGACCGA ACCAGCGCCG CGGCGGCTGG GACAGGCTCC AGCGCCGGAA CCCGGGCGAG GCCGGCCCGG ACGCCCGAAC CCGGGAGCAG CACCTGGAGC AGAGTCAGGA CCACGCCCAG GACGGCGTCC ACGACGACAC CACCGACCAC GACCACAGCA CCGACCACGG CCACGGCCAC GGCCACGGCC ACGGGTCAGG GTGGCTCCCG GGGTCGGCCG GTCGGCGGCG TGGCGTGGCG TCGCGGCTCG CCGTCGTCGT CTCCGCCATC CTGTCGTGCC TGATCTTCGC CTTCGCCGTC GGCGGGTTCG CCGTCTACGA GCACTTCGAC CGCCAGATCA ACCGGCTGCG GCTGAGCCTG GACGGCGACC GGCCCGCAAG CCCCGTCGAG GGGACGACCA ACTTCCTGCT CGTCGGCTCG GACAGCCGGG CCGGCACCGG GGGCGAGTTC CAGCGCGGCG GCAAGGTCGC CGGCCAGCGT TCGGACACCA CGATCCTCGC CCACCTCGAC GCGAACGGGA CGACGACCCT GGTGTCGTTC CCGCGCGACA CCCTCGTGCG CATCCCCGGG CACGGCCGGG ACAAGCTGAC CCAGGCGATC TCCATCGGGG GCCCGGGGCT GCTGGTCCGG ACCATCGAGA ACCTCACCGA CATCCGTGTC GACCACTACG TGTCGGTCGA CCTCGCCGGG TTCCGCGAGA TGACCGACGC GATCGGCGGG GTCACGGTCT GTGTGAAGGC GCTGCCGGAC GGGCGGCGGA CGAACCTGCG TGACGAGTGG TCCCAGTGGC GGGGGCGGGT CGGCGAGAAC CACCTGACCG GCGACCAGGC CCTCGCGTTC GTCCGCCAGC GCCACGGCCT GCCCGACAAC GACTTCGACC GCATCCGCCG GCAGCAGCAG TTCATCGGGG CGGTCTTCCG CAAGGCCACC AGCGACGGCG TGCTGACGAG CCCGGCCCGG CTGGAGAACC TGATCAGCGC GGTGACGCGG GCGCTGACCA TCGACGACGG AACGGACATC GAGGATCTCC GGCTGCTCGC GAAGCGGATG GGGTCGATGA GCTCCGACCA GATCAGGTTC GTGACGATCC CCGTGCACGC GCCGTCGCCG GCCGAGGGCG GGAACGCGCT CGGCGAGCTG CCCCGGTTCG GTTCCGTGCA GCTTTACGAC CAGGCGCAGC TCGACGCGTT CCTGGCGCCG CTGCGCGGCC GGGACGGCAC CAGCCCGACG GCCGTCCCCG CCCCGCCGGC GTCGCCCCCG GGCGAGGTTT CCGTCGACGT GTTCAACGCC GCGCGGGTCG GGGGGCTCGC AGCGGCCGTG CGCAGTGACC TCGCCAGTCT CGGGTTCCGC GTCGGAACCC CGCGGGACTG GCCCGCCGGC TCGCTGCAGA CCAGCGAGGT GCGGTACGGG CCCGGCGGCG AGGCGGCGGC GCGCGCCGTG CGGGCCGTCG TGCCCGACGC CAGGCTTGTC CGCGACGACG ACCTGGCCGA CCGGATCTCC CTGGTGCTGG GCGAGTCGTT CGAGAAGGTG GACGCGACCG GCGTCCCCGC GGCCGGGGCC CGAGCGGTCT CCGGTATGCG CCCGTCGGCA CCGGGCAGCG CCTCGACGGG GTCCGCGGGC CCGGCCCTGT CCGGAGCCCC CACCAGGCCG ACCGCCCCGG TGACGGCGAC CGAGCTGACC ACCGGCTGCA CGTACTGA
|
Protein sequence | MTGAGPRAST GPRQWPESPR TGSDPAAGGR RRPRSRPHAR SRQREVTHSN QDGSETELTE PRPNQRRGGW DRLQRRNPGE AGPDARTREQ HLEQSQDHAQ DGVHDDTTDH DHSTDHGHGH GHGHGSGWLP GSAGRRRGVA SRLAVVVSAI LSCLIFAFAV GGFAVYEHFD RQINRLRLSL DGDRPASPVE GTTNFLLVGS DSRAGTGGEF QRGGKVAGQR SDTTILAHLD ANGTTTLVSF PRDTLVRIPG HGRDKLTQAI SIGGPGLLVR TIENLTDIRV DHYVSVDLAG FREMTDAIGG VTVCVKALPD GRRTNLRDEW SQWRGRVGEN HLTGDQALAF VRQRHGLPDN DFDRIRRQQQ FIGAVFRKAT SDGVLTSPAR LENLISAVTR ALTIDDGTDI EDLRLLAKRM GSMSSDQIRF VTIPVHAPSP AEGGNALGEL PRFGSVQLYD QAQLDAFLAP LRGRDGTSPT AVPAPPASPP GEVSVDVFNA ARVGGLAAAV RSDLASLGFR VGTPRDWPAG SLQTSEVRYG PGGEAAARAV RAVVPDARLV RDDDLADRIS LVLGESFEKV DATGVPAAGA RAVSGMRPSA PGSASTGSAG PALSGAPTRP TAPVTATELT TGCTY
|
| |