Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_1857 |
Symbol | |
ID | 4242720 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | + |
Start bp | 2841315 |
End bp | 2842685 |
Gene Length | 1371 bp |
Protein Length | 456 aa |
Translation table | 11 |
GC content | 36% |
IMG OID | 638106978 |
Product | cell envelope-related transcriptional attenuator |
Protein accession | YP_721586 |
Protein GI | 113475525 |
COG category | [K] Transcription |
COG ID | [COG1316] Transcriptional regulator |
TIGRFAM ID | [TIGR00350] cell envelope-related function transcriptional attenuator common domain |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 8 |
Plasmid unclonability p-value | 0.046941 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 10 |
Fosmid unclonability p-value | 1 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAATAATA GTTCTTCTAA TAACTCCCAG CCTATTTTCT TGAAATTGTG GTTCTGGATA CTTATATTAC TGTTAAGTGG AACCATAACT ACTTCTTTGG GAGCAATAAC TGCTTTGTTT GCCCCTGTTG AACCAGAATT TATAACTCAG CTACTCCAAG AGTTTATTCC AGATATTATT GTTTGGCGTA AACGCTTACC TTACCAACTT TCACGACCTA TGAATATCCT AGTGATGGGA ATTGATGAAG TTCCTGGGGT AACAGAAGAT TCTCCTAATG TATTTGAAGG CCGTAGTGAT ACTTTACTAT TAGTACGAGC TAATCCTAGG AAGAAGACTG TAAGTTTACT TTCTGTACCA CGAGATACTA AAGTCCAAAT TCCTGGAATA GGTCTTGCTA AAATTAATGA AGCTAATCTA TATGGGGGAC CAAAACTGGC AAAAAATATT TTAAAAAATA CTCTGAATAA TGTTGAGGTT GATCGTTATG TGCGAGTAAG CAAAAAGGGG TTTCGAGAAT TAGTAGAACA ATTGGGTGGG GTAGAAGTTT TTATTTCTCA ACCTATGTTT TATATTGACA ATACTCAGCA GTTAAAAATA GATTTACAGC CAGGTTGGCA AACTCTCAAT GGAGAACAAG CGCAGCAATT TGCTCGTTTC CGAGACGAAG TTTATGGCGA TATTGGGAGG GTGCAAAGAC AACAAGTTTT GCTAAAGGCC CTGCGAAGTC GTATTACTAA TGTGACAGTT TTACCTCGCC TACCTCAAAT TATTCGGGTG ATGCAGAAGT ATGTTGATAC TAATTTAAGT TTTCCGGAAA TGCTGGCTTT GATCAGTTTT AGTCTTGACT TGGAGCGAGA TGATTTAAAA ATGGTAATGT TGCCAGGAAG ACCTAGTTCT AGAAATGAGT ATTTTGCTAG TTATTGGATT ATAGATCAGG CGGGTCGCGA TCGTGTCCTA GAACAATATT TCGATTTCAA GTTAGACAAG TTTAATTATG ACAATATATA TAAAAAATAT AAAAATGAAA TTTTGCCACA AGAAAAGAAA ATAGCCGTGC AAAATGCCTC TAGTCACCCT CAAGTAGCAA CAAAATTTGC TCAGTATTTG CGTAACGAAG GATTTGACAA TGTTTATGTA GTTCCTGACT GGTCTGACAA ACAACATTTG ACTCAAGTTA TTGTTCAAGG GGGTTATTTA GATTTAGCAG AAGTTTTGCA AAATACTTTG GGTTTTGGTA ATATTGAACC AACTTCTACA GGTGAGATAG GTTCTGATTT GACCATTAGA TTAGGGGAAG ATTCTATTAA TAAAATTAAT CTTTTAGAAA AAAATGAAGT TGAAGCCAAA ATTGATTCAT CAGACCTTTA A
|
Protein sequence | MNNSSSNNSQ PIFLKLWFWI LILLLSGTIT TSLGAITALF APVEPEFITQ LLQEFIPDII VWRKRLPYQL SRPMNILVMG IDEVPGVTED SPNVFEGRSD TLLLVRANPR KKTVSLLSVP RDTKVQIPGI GLAKINEANL YGGPKLAKNI LKNTLNNVEV DRYVRVSKKG FRELVEQLGG VEVFISQPMF YIDNTQQLKI DLQPGWQTLN GEQAQQFARF RDEVYGDIGR VQRQQVLLKA LRSRITNVTV LPRLPQIIRV MQKYVDTNLS FPEMLALISF SLDLERDDLK MVMLPGRPSS RNEYFASYWI IDQAGRDRVL EQYFDFKLDK FNYDNIYKKY KNEILPQEKK IAVQNASSHP QVATKFAQYL RNEGFDNVYV VPDWSDKQHL TQVIVQGGYL DLAEVLQNTL GFGNIEPTST GEIGSDLTIR LGEDSINKIN LLEKNEVEAK IDSSDL
|
| |