Gene Tery_3849 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3849 
Symbol 
ID4242300 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5946968 
End bp5948986 
Gene Length2019 bp 
Protein Length672 aa 
Translation table11 
GC content39% 
IMG OID638108781 
Productcarbonate dehydratase 
Protein accessionYP_723364 
Protein GI113477303 
COG category[R] General function prediction only 
COG ID[COG0663] Carbonic anhydrases/acetyltransferases, isoleucine patch superfamily 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.226074 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.037707 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCACACC GCAAACAACC AGCTCCACCC ACCCCCTGGT CAAAAAACTT GGCACAGCCG 
AAGATTGATG ACACAGCTTA TATTCACTCC TTTTCCAATA TCATTGGAGA TGTCCGTGTT
GGAGCTAATG TATTAGTAGC TCCGGGCACA TCAATTCGTG CAGATGAAGG TACTCCCTTC
TTTATTGGAG CAGGAACTAA TATCCAAGAT GGAGTAGTTA TACATGGTTT AGAACAAGGA
CGAGTTATAG GAGATGACCA ACAAAACTAT TCTGTATGGA TAGGTACAAA TGTTTCTATC
ACTCATAAAG CTTTAGTTCA TGGTCCTTGC TATATCGGTG ATGACTGTTT TATCGGCTTT
CGTTCTACAG TTTTTAACTC CCGCATTGGT GAAGGATGTA TAGTTATGCT TCACGCTCTG
ATCCAGGACG TGGAAATTCC TCCCGGTAAG TATGTGCCTT CAGGAGCAAT TATTACAAAT
CAACAGCAAG TAAATCGTTT GTCAGATGTA CTACCCGATG ATATAAAATT TGCCCATCAT
GTAGTGGGAA TTAACGAATC TTTACGACAA GGTTATCTGT GTGCGAATAA TATATCTTGC
ATTACCCCTA TTAGAAATGA AATGAATATT AATTATAAAA ATGGTAACGG TTACAACCCT
TCAGGAACAA CTGGTAGACT AACCCCAGAA GTAGTTGCTC ATGTAAACCA GTTAGTATCC
CAAGGATATT ATGTTGGTAC AGAACACGCT GACACCCGTC ACTTCAAAAC AGGTTCCTGG
AAAACTTGTT CTCCAATTCA AAGTAGTCAC TCTTCAGAAG TAGTAGCAGC TCTAGAAGCT
TGTATACAAG AACATTCTAC AGAGTATGTG CGGATGTTTG GTATAGACCC TAAAGCTAAA
CGTCGTATAT CTCCAATTAT GATTCAACGT CCTGATGGTA AAAAAGTTGC TCAAAAATCA
ACGACTGGTA ACTACAGTGT TCCTGCTGCT ACTGGTACTA CTAGGGTTGG AAGTACTACT
ACCCCCAATA CTACAGGTCT AACTCCAGAA GTAGTAACCC AAGTTAATTC TTTGCTGTCT
CAAGGATACA AGATTGGTAC GGAGTATGCC AATGAACGTC GTTTTAAAAC TAGCTCTTGG
CAAAACGGTC CGACTATTTC TGAGACTAAT TCTGCACAAG TTTTGGCTGC TCTAGAAAAA
TTTTTAGCAG AACACAGTGG TGAATATGTA CGTTTAATTG GTATAGACTC TAAAGTTAAA
CGTCGTGTTG CAGAAATAAT AATTCAACGA CCAGGCGATA GCCCAATTCA ACAATGTGTA
TCTACTTCTC CAAGTTATCA AGCTCCTGTA TCTACTCATG CAGGAATTAA TACTCGATTA
AGTCAGGAAG TTGTAGAGCA AGTACGTTCA TTATTTAATC AAGGATATAG AATTAGCTTA
GAACACGCTA ATGAACGTCG CTTTAAAACT AGCTCCTGGA TAAGTTGCGC TCCCATTTCT
GCTACTAACC ATTCTCAAGC AATAGCTGAA TTAGAACAAG TTTTAGCAGA ATATAATGGG
GAATATGTAC GTTTAATTGG TATCGATACT CAAGCTAAAC GTCGGGTCAT GGAAAGTTTG
ATTCAACAAC CCAATGGTAA AGGTGAAAGA TCTGCTTCTC TTAAGGCTAC TTCTAATGGA
GTAGTCAATA CTACTCAACA ATCTCCTGTT TCTAGTAGTC AAGTGGCTAC AACAATAGCC
CATAAATTAA GTCAAGAGGC TGTGGAAGAA ATTCGTTCTT TGATTGCAGG TGGTTATAAA
ATTGGTACAG AATATGCTGA TAAACGTCGT TTTAAAACTA GCTCTTGGAA AACAGATATT
CAAATAGATG GTAAACGAGA GGCTGATGTT TTTCCAGTGC TTGAAGAAAG TCTGGCTCAC
CATGAGGGAG AATATGTCCG CTTGATAGGT ATAGATCCAA AAGCGAAACG CCGAGTCTTA
GAAAAGATTA TTCAACAACC TAACGGTAAG GCTAACTGA
 
Protein sequence
MPHRKQPAPP TPWSKNLAQP KIDDTAYIHS FSNIIGDVRV GANVLVAPGT SIRADEGTPF 
FIGAGTNIQD GVVIHGLEQG RVIGDDQQNY SVWIGTNVSI THKALVHGPC YIGDDCFIGF
RSTVFNSRIG EGCIVMLHAL IQDVEIPPGK YVPSGAIITN QQQVNRLSDV LPDDIKFAHH
VVGINESLRQ GYLCANNISC ITPIRNEMNI NYKNGNGYNP SGTTGRLTPE VVAHVNQLVS
QGYYVGTEHA DTRHFKTGSW KTCSPIQSSH SSEVVAALEA CIQEHSTEYV RMFGIDPKAK
RRISPIMIQR PDGKKVAQKS TTGNYSVPAA TGTTRVGSTT TPNTTGLTPE VVTQVNSLLS
QGYKIGTEYA NERRFKTSSW QNGPTISETN SAQVLAALEK FLAEHSGEYV RLIGIDSKVK
RRVAEIIIQR PGDSPIQQCV STSPSYQAPV STHAGINTRL SQEVVEQVRS LFNQGYRISL
EHANERRFKT SSWISCAPIS ATNHSQAIAE LEQVLAEYNG EYVRLIGIDT QAKRRVMESL
IQQPNGKGER SASLKATSNG VVNTTQQSPV SSSQVATTIA HKLSQEAVEE IRSLIAGGYK
IGTEYADKRR FKTSSWKTDI QIDGKREADV FPVLEESLAH HEGEYVRLIG IDPKAKRRVL
EKIIQQPNGK AN