Gene Tery_1857 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1857 
Symbol 
ID4242720 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2841315 
End bp2842685 
Gene Length1371 bp 
Protein Length456 aa 
Translation table11 
GC content36% 
IMG OID638106978 
Productcell envelope-related transcriptional attenuator 
Protein accessionYP_721586 
Protein GI113475525 
COG category[K] Transcription 
COG ID[COG1316] Transcriptional regulator 
TIGRFAM ID[TIGR00350] cell envelope-related function transcriptional attenuator common domain 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.046941 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAATA GTTCTTCTAA TAACTCCCAG CCTATTTTCT TGAAATTGTG GTTCTGGATA 
CTTATATTAC TGTTAAGTGG AACCATAACT ACTTCTTTGG GAGCAATAAC TGCTTTGTTT
GCCCCTGTTG AACCAGAATT TATAACTCAG CTACTCCAAG AGTTTATTCC AGATATTATT
GTTTGGCGTA AACGCTTACC TTACCAACTT TCACGACCTA TGAATATCCT AGTGATGGGA
ATTGATGAAG TTCCTGGGGT AACAGAAGAT TCTCCTAATG TATTTGAAGG CCGTAGTGAT
ACTTTACTAT TAGTACGAGC TAATCCTAGG AAGAAGACTG TAAGTTTACT TTCTGTACCA
CGAGATACTA AAGTCCAAAT TCCTGGAATA GGTCTTGCTA AAATTAATGA AGCTAATCTA
TATGGGGGAC CAAAACTGGC AAAAAATATT TTAAAAAATA CTCTGAATAA TGTTGAGGTT
GATCGTTATG TGCGAGTAAG CAAAAAGGGG TTTCGAGAAT TAGTAGAACA ATTGGGTGGG
GTAGAAGTTT TTATTTCTCA ACCTATGTTT TATATTGACA ATACTCAGCA GTTAAAAATA
GATTTACAGC CAGGTTGGCA AACTCTCAAT GGAGAACAAG CGCAGCAATT TGCTCGTTTC
CGAGACGAAG TTTATGGCGA TATTGGGAGG GTGCAAAGAC AACAAGTTTT GCTAAAGGCC
CTGCGAAGTC GTATTACTAA TGTGACAGTT TTACCTCGCC TACCTCAAAT TATTCGGGTG
ATGCAGAAGT ATGTTGATAC TAATTTAAGT TTTCCGGAAA TGCTGGCTTT GATCAGTTTT
AGTCTTGACT TGGAGCGAGA TGATTTAAAA ATGGTAATGT TGCCAGGAAG ACCTAGTTCT
AGAAATGAGT ATTTTGCTAG TTATTGGATT ATAGATCAGG CGGGTCGCGA TCGTGTCCTA
GAACAATATT TCGATTTCAA GTTAGACAAG TTTAATTATG ACAATATATA TAAAAAATAT
AAAAATGAAA TTTTGCCACA AGAAAAGAAA ATAGCCGTGC AAAATGCCTC TAGTCACCCT
CAAGTAGCAA CAAAATTTGC TCAGTATTTG CGTAACGAAG GATTTGACAA TGTTTATGTA
GTTCCTGACT GGTCTGACAA ACAACATTTG ACTCAAGTTA TTGTTCAAGG GGGTTATTTA
GATTTAGCAG AAGTTTTGCA AAATACTTTG GGTTTTGGTA ATATTGAACC AACTTCTACA
GGTGAGATAG GTTCTGATTT GACCATTAGA TTAGGGGAAG ATTCTATTAA TAAAATTAAT
CTTTTAGAAA AAAATGAAGT TGAAGCCAAA ATTGATTCAT CAGACCTTTA A
 
Protein sequence
MNNSSSNNSQ PIFLKLWFWI LILLLSGTIT TSLGAITALF APVEPEFITQ LLQEFIPDII 
VWRKRLPYQL SRPMNILVMG IDEVPGVTED SPNVFEGRSD TLLLVRANPR KKTVSLLSVP
RDTKVQIPGI GLAKINEANL YGGPKLAKNI LKNTLNNVEV DRYVRVSKKG FRELVEQLGG
VEVFISQPMF YIDNTQQLKI DLQPGWQTLN GEQAQQFARF RDEVYGDIGR VQRQQVLLKA
LRSRITNVTV LPRLPQIIRV MQKYVDTNLS FPEMLALISF SLDLERDDLK MVMLPGRPSS
RNEYFASYWI IDQAGRDRVL EQYFDFKLDK FNYDNIYKKY KNEILPQEKK IAVQNASSHP
QVATKFAQYL RNEGFDNVYV VPDWSDKQHL TQVIVQGGYL DLAEVLQNTL GFGNIEPTST
GEIGSDLTIR LGEDSINKIN LLEKNEVEAK IDSSDL