Gene Tery_2451 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2451 
Symbol 
ID4244635 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3775803 
End bp3776918 
Gene Length1116 bp 
Protein Length371 aa 
Translation table11 
GC content39% 
IMG OID638107536 
Producttaurine catabolism dioxygenase TauD/TfdA 
Protein accessionYP_722136 
Protein GI113476075 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2175] Probable taurine catabolism dioxygenase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones24 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAGAGTA CAGCAACTTT AAATAAGACT TTTCTTACAG TCAAAGGCAA ACGTTTTCAC 
TATGTTTGGT TGCGGGATAA TTGCTTATCT CCAAAATCTC GTCATCCAAC TTCTTTTCAG
AAACTATATG AAATGAAGCA TACTTTATGC CCTGAACCAT TATCTGTAGA AGAGAAAGAT
GGGGAACTGA CTATCATTTG GAACGAAGAT CCTCCTCACA AAAGTACATT CTCAATATCT
TGGTTGTTGA GTCACGCTTA CGATGACGGA GAGCAGGATA ATGAAAATGA AGACTCTGAA
TCTAATTCTC AGAACCAGGA ATTTTTATGG GATAAAGCTT GGATAGAAGC AAATATATCA
AAGTTGCAGG AAGCTCTTTC ATCCAATCCC GAATTGTGGC TCGAGCAGCT TTTTACTTTC
GGATTCACCG TACTCCATAA TATACCAGCC AAGGATTTGC AGGCTACAAT AGAATCCATT
GGGCCAATTT ACAACGGTGA CTACGGATTG TTCGCACCAT CAAAGACTAC AAATGAAGGA
AAGGATTTAG CAGAGACTGG TAATGCGATG AGCTTCCATA CTGACTATAC TTATTGGCAC
ACTCCACCTT TGCTCACCAG CTTGTATTGT GTAGAAAATA GTGCTTCTGG TGGGGAGTCG
CTGATCGTAG ACGGTTTTCG AGTTGTGGAT GATTTTCGTC AGCAACATCC GGATTATTTT
CAGATTTTGA CTCAAACTCC CATCCAGTTC AAACAGGTGT ACACAAAATG GCAATACTTT
TATTCCAGAA CTCAGCCTAT ATTAGAATTA GATGAATATG GAAAAGTGAC TAGAATCAAT
TTTGCCAACT CTCACAGCTA CACTTGGAAA TTACCATTCG ATCAAATGGA GGAATTTTAT
GCAGCTTACA TAACCTTCTT TCAATATGTA AAAAATCCAG TTTATGAATA CTGCTTTAGT
CTAGAACCGG GAGACCTTCT GTTGATGAAT GACTCCAGGA TTATGCATGG ACGGAAAGCT
TTTACAGGTA ATCGACATCT GGAAATAGCC TGTGTTTCTT GGGATTTTTT GCAGGCACGG
CAGCGCTTTC ATCAAAATAA ACATCTTTTT ATGTAG
 
Protein sequence
MKSTATLNKT FLTVKGKRFH YVWLRDNCLS PKSRHPTSFQ KLYEMKHTLC PEPLSVEEKD 
GELTIIWNED PPHKSTFSIS WLLSHAYDDG EQDNENEDSE SNSQNQEFLW DKAWIEANIS
KLQEALSSNP ELWLEQLFTF GFTVLHNIPA KDLQATIESI GPIYNGDYGL FAPSKTTNEG
KDLAETGNAM SFHTDYTYWH TPPLLTSLYC VENSASGGES LIVDGFRVVD DFRQQHPDYF
QILTQTPIQF KQVYTKWQYF YSRTQPILEL DEYGKVTRIN FANSHSYTWK LPFDQMEEFY
AAYITFFQYV KNPVYEYCFS LEPGDLLLMN DSRIMHGRKA FTGNRHLEIA CVSWDFLQAR
QRFHQNKHLF M