Gene Tery_2076 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2076 
Symbol 
ID4245724 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3241214 
End bp3243199 
Gene Length1986 bp 
Protein Length661 aa 
Translation table11 
GC content34% 
IMG OID638107187 
Productprotein of unknown function DUF900, hydrolase-like 
Protein accessionYP_721790 
Protein GI113475729 
COG category[S] Function unknown 
COG ID[COG4782] Uncharacterized protein conserved in bacteria 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.445606 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGTTATATA ATTTCAGACT CAGAAAAGTC ATTCGGGAAT TACTTAATCA AGATCTACCC 
GAAGAAGAAT TTAACGACTT AGTTTATGAT TATTTTCCTG ATGTTTATAA CCAATTTACA
AATGGACAAA ATAAAAAGCA AAGAGTCAGA ATTTTAATTG AATATGCTGA CAAACATAGA
GAAATTGAGC GGTTACTTGA AGGCATTAAA AATATTAATC CAAAAGTTTA TCAAGAGTAT
GAGTCAAAAT TAGGAGAAAA TCCCCCTCCG CCTCCAATTG AAAAATGTGA TGTTTTGGTT
TTAGCAGCAA ACCCTACAAC TACACAGCCA CTACAATTAA AAAAAGAAAC TGAATTAATT
AGGGAAAAAC TACAGCAGAC AGAATTTGGA AAAAATTATA TTGTTTATGG AGAAGAAAAT
GCTTTTATAG AAGATTTATC TCAATATTTG CTGAAATATG AGCCTAGAAT TCTTCACTTT
AGCGGTCATG GTAATTCTCA AGGTGAAATA ATTTTAAACA ACCGTCAAGG TGAGGCAGAG
GTTTTATCCC TCGAAACATT ATCAGAATTA TTATCTATTG TTAGAAAAGA TGGAAAACCT
ATAGAATGTG TTGTATTTAA TGCCTGTTTT TCTCTGAAAA AAGCTGATGC AGTCGCTCAC
CAGGTAGGTT GTGTTATTGG CATGAAAAAA GAGATTGGTG ATGATTCTGC TTTGATATTT
GCCGAAGAAT TTTATCAAGG TTTAGCATAT CAAAGGAGCT ATTATCAAGC TTTTCAACTA
GGTATAAATG GAATTGAACG CTTAAGATTA CCTGATAGTC CAATTCCTCA TTTTATTCCT
TTTGATACAT CATTATTAGA GTCAGAAACT GTCAGTTTAA GAAGTCATCA AACCAACGGT
TATTTGACTT CAAAAGAAGC CGTAACAAAG AAAGCAATAA AGAAAAAAGA AACCGTAAAA
GTCAAAAGAT CTCTGATTTT AAAAGATACT AAAGAAACAA CAGCAACTAT ATATCCTTTA
TGGTTTGGTA CCAACAGAAA ACCTGTAGAT ACAAATAATA TATCCAAAGG TTTTTCCGGA
AAAAGAGATG ACAAACTTCA CTATGGTATT TGTCAAGTAG CTGTTCCTAA ATCTCATAAA
ATAGGCTCTA TAGGTTCCCC TTTGTGGAAA AGATTAATTA CTTTCAAAGA CGATCGCCTC
AAACTACATT TTCAAAGTTT GCAAATTCTG GAAAAAGAAC TATTTTGGGA AAATATCAAC
GAAGAATTAA AAGACCATGA AATAAATGAA AGGTCTGCTT TAGTCTTTGT TCATGGATAC
AACGTCAATT TTGAAGATGC AGCTATTAGA GCCGCACAAA TGGGGTTTGA CCTGCAAGTG
CCAGGAATTA CAGCCTTTTA TAGTTGGCCA TCTCAAGGGA AATTATCAGC ATATCCGGTA
GACGAGGCAA GTATTGAAGC CAGCGAAAAG TACATGACAG AATTTTTACT CAACCTAGCC
GAAAAAACGG ACATTGAGAA AATTCATATT ATTGCTCATA GTATGGGAAA CCGAGGTTTA
CTCAGAGCAG TCCAAAGAAT TATTTCTCAA GTTCAAACAA TAACTAATAT TGCTTTTGGG
CAAATTATTT TAGCCGCTCC AGATGTAGAT ATTGACTTGT TTAAAGAGTT AGCTAAAGGA
TATCATCAAT TAGCAGAACG AACTACATTA TACATATCAT CAAAAGACAA AGCCTTAGCA
ACTTCAGCGC TTATTCATCA GCATGGCCGA GCTGGTTTTT TCCCCCCTGT TACTGTTGTA
GAAGGAATAG ACACGGTAAA AGTTTCTAAG ATAGATTTAA CTTTATTAGG ACATGGTTAT
TTTGCTGATG CTCGTTTGGT ACTTGAAGAT ATACGGGACT TATTAATTAA TAATACTTCC
CCAGGGCAGC GAAGAGGTCG GTTAGAACCG TCGGAAGAGG GGGGTTATTG GATTATGCGG
CAGTAA
 
Protein sequence
MLYNFRLRKV IRELLNQDLP EEEFNDLVYD YFPDVYNQFT NGQNKKQRVR ILIEYADKHR 
EIERLLEGIK NINPKVYQEY ESKLGENPPP PPIEKCDVLV LAANPTTTQP LQLKKETELI
REKLQQTEFG KNYIVYGEEN AFIEDLSQYL LKYEPRILHF SGHGNSQGEI ILNNRQGEAE
VLSLETLSEL LSIVRKDGKP IECVVFNACF SLKKADAVAH QVGCVIGMKK EIGDDSALIF
AEEFYQGLAY QRSYYQAFQL GINGIERLRL PDSPIPHFIP FDTSLLESET VSLRSHQTNG
YLTSKEAVTK KAIKKKETVK VKRSLILKDT KETTATIYPL WFGTNRKPVD TNNISKGFSG
KRDDKLHYGI CQVAVPKSHK IGSIGSPLWK RLITFKDDRL KLHFQSLQIL EKELFWENIN
EELKDHEINE RSALVFVHGY NVNFEDAAIR AAQMGFDLQV PGITAFYSWP SQGKLSAYPV
DEASIEASEK YMTEFLLNLA EKTDIEKIHI IAHSMGNRGL LRAVQRIISQ VQTITNIAFG
QIILAAPDVD IDLFKELAKG YHQLAERTTL YISSKDKALA TSALIHQHGR AGFFPPVTVV
EGIDTVKVSK IDLTLLGHGY FADARLVLED IRDLLINNTS PGQRRGRLEP SEEGGYWIMR
Q