Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Sare_4011 |
Symbol | |
ID | 5707433 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Salinispora arenicola CNS-205 |
Kingdom | Bacteria |
Replicon accession | NC_009953 |
Strand | - |
Start bp | 4562720 |
End bp | 4563862 |
Gene Length | 1143 bp |
Protein Length | 380 aa |
Translation table | 11 |
GC content | 65% |
IMG OID | 641273436 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_001538792 |
Protein GI | 159039539 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 21 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 4 |
Fosmid unclonability p-value | 0.000201414 |
Fosmid Hitchhiker | Yes |
Fosmid clonability | hitchhiker |
| |
Sequence |
Gene sequence | ATGACTGTGG GTAAGGCCGG CAAAGGCGGC AAGAAGCGGC CGTCTGTGTG GGCGGGCGTG CCACGGTGGG CCCAGGTGTG CACCGTCTTC GGTGCTGTGC TGATGTTCGT CAGCGGGGCG GCTCTGGTCG GGGCCGAGGC GCTGATGGCC CGGTACGAGG GTGCGGTGGG TAAGGCGGAC CTGTTCGGGG ACCAGGCGGC AGGCGCCAGT GAGCGCACGA GCGACATCAA GGGACCGCTC AGCATCCTGC TGGTCGGTGT TGATCCCCGG AAGCCGGAAC AGCCGCCGTT GGCCGACTCG ATCATGGTGC TGCACGTGCC GGAGGGCCTC GACCGGGCGT ACCTCTTCTC AATGCCCCGT GATCTCTACG TTGACATTCC CGCCTTCGAG AAGGCCGGGT TCCCTGGCGG CCAGGACAAG CTCAACGCCG CGATGGCCTA CGGCAGCCGT CAGCAGGGGG AGAACCCGAG CTCGGCGCAG GGTTTCGAGC TGCTCGCGAA GACGGTGCAG TCGTTGACCG GCATCAAGCG GTTCGACGCC GGCGCGATCA TCAATTTCGG TGGGTTCATC AAGATCGTGG ACGCGATGGG CGGTGTCACG ATGGACATCG AGCGCGAGGT GCGCTCAGAG CATCGTCGTC CCGACGGCAC CCATCGTGAG CTGCGCCCCG GCGGCGGGGG ATACCTTGGT GAGCAGGCGG TCTACCCGGA AGGTGAACAG CTCCTCGAGG GTTGGCAGGC GCTGGACTAT GTCCGTCAGC GCTACCCGGC GAACGGCGTG CCGGATGGCG ACTACGGTCG CCAGCGCCAC CAGCAGCAGT TCGTCAAGGC AATGGCGAGT CAGGCGTTGA GCGCCGACGT GGTGACCAAT CCGATCAAGC TCGACCGGGT ACTCCGGGCC GCTGGCGAGT CACTGGTGTT CAACGGCCGG GGGCACAGTG TGATTGACTT TGGTATCGCC CTCAAGGACC TCCGACCGGG CAACATCCAG ATGATTAAGT TGCCGGGTGG CGGGATCACG GCTAATGGCA AGTACCAGGG CGAGCGTTTC GAGCCGGCCG TACAGGACTT CTTCCGGGCG TTGAGAGACG AGCAGCTCGA CGCCTTCCTG CTGGAGCACC CGGACTTTCA GAACAAGGGC TAA
|
Protein sequence | MTVGKAGKGG KKRPSVWAGV PRWAQVCTVF GAVLMFVSGA ALVGAEALMA RYEGAVGKAD LFGDQAAGAS ERTSDIKGPL SILLVGVDPR KPEQPPLADS IMVLHVPEGL DRAYLFSMPR DLYVDIPAFE KAGFPGGQDK LNAAMAYGSR QQGENPSSAQ GFELLAKTVQ SLTGIKRFDA GAIINFGGFI KIVDAMGGVT MDIEREVRSE HRRPDGTHRE LRPGGGGYLG EQAVYPEGEQ LLEGWQALDY VRQRYPANGV PDGDYGRQRH QQQFVKAMAS QALSADVVTN PIKLDRVLRA AGESLVFNGR GHSVIDFGIA LKDLRPGNIQ MIKLPGGGIT ANGKYQGERF EPAVQDFFRA LRDEQLDAFL LEHPDFQNKG
|
| |