Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Franean1_5205 |
Symbol | |
ID | 5673539 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Frankia sp. EAN1pec |
Kingdom | Bacteria |
Replicon accession | NC_009921 |
Strand | + |
Start bp | 6249412 |
End bp | 6251094 |
Gene Length | 1683 bp |
Protein Length | 560 aa |
Translation table | 11 |
GC content | 71% |
IMG OID | 641244059 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001509469 |
Protein GI | 158316961 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 0.0714783 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 16 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGCGGGACG CGCGCGCACG GCAGTCGTGC GGCGCCCCCG AAGTGACGCC GGCAGAGACC GCGGACGGTG AGCACGACAC CCGCGAACGC GACATCGGCG CCCGCGGCGC GGAACTCGAC GGGCTAAAGA TCGGCGGCCC GGACGTCAGC GGCGACGACA TCGGCGGCGA GACCGAAGGC GGCAGCAAGG CCGGCCCGCC GGGTAACGGG AGCGGCGCGG CGGCCGGCGC CGCGCGGCGC GGCCCGATAC GCCGGGTTCT GCTCGTCCTG ACCGCCCTGC TGTCGGTCGC CGTGGTGGTG GTGACCACCA CCGGCTGGTT CGTCATCACC TTCTACGACC GCAGGATCGA TCGCGAGACC ATCGCGCCGC CTGCCGACAT CACGGTGACC CGCCCACCGC CCGCGCCGGT CGGCACCGAG ACCTGGCTCC TCGTCGGCTC CGACGTGCGC ACCGGCTCGG ACGCCGCGGC GGTCAGTGGC GCGCGATCCG ACACTATGAT GATCGCCCAC CTGGCCTCGG ACGGGCGGAC GAACATCGTG TCGGTCCCCC GTGACCTGAG GGTGCCCATC CCGGCCTGGA CCGACGACGA CGGCACCCAC CACCGGGCCC GCCGAGACAA GATCAACGCA GCGTTCGGCA GCGGTGGCCC CGCGCTCCTC GTCGCCACCC TCGAGCAGGT GGCGGGACTG CGCATCAACC ACTACGCCGA ACTCGACTTC AACGGCTTCC AGCAGATGAC GTCCGCGATC GGTGGCATCG ACGTGTGTCT GCAGGCATCG AGCTACGTCG AGCCGCACAC ACTGGAGAAC GGCCGGCGGG TGCGGTCGAT GAACCTGAAC GACCCCAGCT CCGGTTTCCT CGGGCAGCCG GGAAACAATC ATCTGATCGG CGGCAATGCG CTTGCCTTTG TTCGGCAACG ACATGGTTTC GCTGACGGCG ACCTCTCTCG GATCCGCCGC CAGCAGGCGT TTCTCGCGGC GATGTTCCGA AAGGTCAGCA GCAGCGACGT CCTCTTGCGC CCGACCAAGC TCGCCGCGTT TCTGGGCGCG GTGACGCGGT CGGTGGTGCT GGACGACGAG ACCGGCTTCA CCGAGCTGCG CGCGCTGGCC GAGCGGATGC GCGGGATGAC GACCGGCGCC GTCACGTTCT CCACCGTCCC GATCACGGGC CAGATCGCCG AACCGGCCTT CTACTTCCTG TATGACCCCG ACCAGATGCG GCAGTTCTTC CGGAACATCA CCGGCGGCGA GTCCCTGCCC GAGCCCACCG GCTCCGGAGA CCTCATCCCG CTCGGCGGCG CCTTCACCCC GGAACCCTCG ATCGGCGCAC CGACCGCCCC CACCGCGGCA GTCGCCCTCC CACCCACCGA GAGCGCCACC CCAACAGTGA CGCCTCAGGT ACCCGACGTG GCATCGACGC CCGCCTCCCC GGCCGAACAG CCAACAGTCA CCCCCACGCC CCCGCCTACG GCGATACCCA CGGCCACGGC GCCGACCGGT GTCAGTGTCG GCGCCGGCGT CGACATCCTG GCCCGGCCGG TCGCCGGGAC CGGGCAGGAC GCCACGGCCG GCACCACCCC GGGCACACTT GGGCCGTCCG CCACGCTGCC GGTGGGGCCG TCGGTGGGAT CGTCCGCGAC GACCGAGCCG CCGGTGACGG CCGCCGCCGC CTGCATCTAC TGA
|
Protein sequence | MRDARARQSC GAPEVTPAET ADGEHDTRER DIGARGAELD GLKIGGPDVS GDDIGGETEG GSKAGPPGNG SGAAAGAARR GPIRRVLLVL TALLSVAVVV VTTTGWFVIT FYDRRIDRET IAPPADITVT RPPPAPVGTE TWLLVGSDVR TGSDAAAVSG ARSDTMMIAH LASDGRTNIV SVPRDLRVPI PAWTDDDGTH HRARRDKINA AFGSGGPALL VATLEQVAGL RINHYAELDF NGFQQMTSAI GGIDVCLQAS SYVEPHTLEN GRRVRSMNLN DPSSGFLGQP GNNHLIGGNA LAFVRQRHGF ADGDLSRIRR QQAFLAAMFR KVSSSDVLLR PTKLAAFLGA VTRSVVLDDE TGFTELRALA ERMRGMTTGA VTFSTVPITG QIAEPAFYFL YDPDQMRQFF RNITGGESLP EPTGSGDLIP LGGAFTPEPS IGAPTAPTAA VALPPTESAT PTVTPQVPDV ASTPASPAEQ PTVTPTPPPT AIPTATAPTG VSVGAGVDIL ARPVAGTGQD ATAGTTPGTL GPSATLPVGP SVGSSATTEP PVTAAAACIY
|
| |