Gene Tery_1438 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1438 
Symbol 
ID4243032 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2174746 
End bp2175996 
Gene Length1251 bp 
Protein Length416 aa 
Translation table11 
GC content43% 
IMG OID638106596 
Productaluminium resistance 
Protein accessionYP_721206 
Protein GI113475145 
COG category[P] Inorganic ion transport and metabolism 
COG ID[COG4100] Cystathionine beta-lyase family protein involved in aluminum resistance 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value0.406452 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAATAGTA TTGAATATCT ACAAGCTGGA GAACAAGAGC TATATCCCAT ATTTTCGGAA 
ATTGATAGCT TGGTTAAGCA AAATCTCAGG GGGGTAATAG ATGCTTTTCG TCATCATCGA
GTAGGTGTCC ATCATTTTTC TGGAGTTACT GGTTATGGCC ATGACGATAT TGGCCGGGAG
ACTTTGGACA AGGTTTTTGC AGAGATTATG GGGGCTGAGG CAGCAGCAGT AAGAATCCAA
TTTGTTTCTG GAACTCATGC GATCGCCTGT GCTTTATTTG GGACTCTAAG GCCAGGGGAT
GAAATGTTGT CTGTTGTGGG CCCTCCTTAT GATACTTTAG AGGAAGTAAT TGGTCTCAGA
GAAGAAGGCC AAGGTTCTCT GAAAGAATTT GGTATTAGTT ACCGAGAATT ACCCCTGACC
GAAACTGGGA CAATAAATTG GCAAGGGTTA GAATCTGCTG TTACTGAAAA AACTCGTCTG
GCTTTAATTC AACGTTCTTG TGGTTATTCT TGGCGAGCTA GTTTAGCTAT TTCAGAGATT
GAGAAAATTG TCAAGATAGT TAAGCAAAAA AATTCTGAAA CGGTTTGTTT TGTAGACAAC
TGTTATGGAG AGTTTATTGA AGACCGGGAA CCGATCGCAG TGGGAGCAGA TTTAGTAGCG
GGGTCTTTAA TTAAAAATCC TGGTGGCACA ATTGTTATGG CGGGGGGCTA TGTAGCAGGT
AGAGCAGATT TGGTGGAAGC TGCAACTTGT CGGTTAACTG CTCCTGGTAT TGGTAGTAGT
GGGGGAGCAA CTTTTGAGCA AAATAGATTA TTGTTTCAAG GTTTATTTTT GGCACCACAA
ATGGTGGGGG AAGCGATGAA GGGAAATCAT TTGACGGCTT ATGTTTTTGA TAAGTTGGGT
TATGCTGTTA ATCCTCTTCC TTTTGAAAAA CGACGGGATG TTATTCAGGG GATTAAGTTG
GGTTCCCCGG AAAAGTTAAT TGCCTTTTGT CGAGCTATTC AGCAATATTC CCCTGTAGGG
TCTTATTTAG ATCCGGTACC GGGGCCGATG CCTGGATATG ACAGTCAGTT GGTTATGGCA
GGTGGAACTT TTATTGATGG GTCTACTTCT GAATTTTCGG CAGATGGACC TTTGAGGGAA
CCTTATGTGG TTTTTTGTCA GGGGGGGACT CATTGGACTC ATGTTGCGAT CGCTCTGGAG
GCGGCTATTA AGGCAGTAGG GAGTAGTAGA GATATTCTGC AACATCCCTA A
 
Protein sequence
MNSIEYLQAG EQELYPIFSE IDSLVKQNLR GVIDAFRHHR VGVHHFSGVT GYGHDDIGRE 
TLDKVFAEIM GAEAAAVRIQ FVSGTHAIAC ALFGTLRPGD EMLSVVGPPY DTLEEVIGLR
EEGQGSLKEF GISYRELPLT ETGTINWQGL ESAVTEKTRL ALIQRSCGYS WRASLAISEI
EKIVKIVKQK NSETVCFVDN CYGEFIEDRE PIAVGADLVA GSLIKNPGGT IVMAGGYVAG
RADLVEAATC RLTAPGIGSS GGATFEQNRL LFQGLFLAPQ MVGEAMKGNH LTAYVFDKLG
YAVNPLPFEK RRDVIQGIKL GSPEKLIAFC RAIQQYSPVG SYLDPVPGPM PGYDSQLVMA
GGTFIDGSTS EFSADGPLRE PYVVFCQGGT HWTHVAIALE AAIKAVGSSR DILQHP