Gene Tery_2034 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2034 
Symbol 
ID4243638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp3165641 
End bp3169732 
Gene Length4092 bp 
Protein Length1363 aa 
Translation table11 
GC content46% 
IMG OID638107148 
ProductRTX toxins and related Ca2+-binding protein 
Protein accessionYP_721751 
Protein GI113475690 
COG category[R] General function prediction only 
COG ID[COG5271] AAA ATPase containing von Willebrand factor type A (vWA) domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.055815 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.277467 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTAAGT TTATTTCTGA TACAGAAAGT AGTGCTCATG GTGAAGAAGG TAATGACCTA 
GAACCTAAAG ACCTAAGTAT AGATACTGAT AAGACAGTCG ACAGTGATGA AGGGGACGAC
GATAGCAAAG AAGAAACGAG TGCTGAAGGC AATGAACCAG CCCCCGCTGT AGAAGAAAGC
GACACTCAGG AAGGAGAAGA AGGAAACAAT ACCGACTCTG GTGAAGATGC TGATGAGACA
GTCGACAGTG ATGAAGGAGA TGACGACAGC GAAGAAGAAA CGAGTGCTGA AGGCAATGAC
CCAACCCTCG CTGCTGAAGA AAGTGACACT CAGGAAGGAG AAGAAGGAAA CGATACCGAC
TCTGGTGCTG GTGAAGATGC TGATGAGACA GTCGACAGTG ATGAAGGGGA CGACGATAGC
AAAGAAGAAA CGAGTGCTGA AGGCAATGAA CCAGCCCCCG CTGTAGAAGA AAGCAACACC
GAAGAAGGAG AAGAAGGAAA CAATACCGGC TCTGGTGAAG ATGCTGATGA GACAGTTAAC
AGTGATGAAG GGGACGACGA TAGCAAAGAA GAAATGAGCG CTGACGGTAA TGACCCAGCC
CCGGCTACAG AAGAAAGCGA TACCCAGAAA GGAGAAGGAG AGAATGAGCG CCTCATTATT
TCTGGAACTG AAGGTAATGA TAGACTGCTT GGTGTTGAAG TTAATGAAGA GATCAAAGGA
AAAGCCGGCG ATGATAAACT TTTCGGTGGT GGTGGAGATG ACCTAGTCCT AGGCGGAGAA
GGAAAAGACT TCCTCCGAGG AGATGGAGGA CATGATACCC TCTCTGGTGG AGAAGGTGAT
GATATCATTG ATGGTGGTGA TGGAGATGAC GACATCAAAG GAGAAGCAGG TGATGATAAA
CTTTTTGGTG GTAAAGGTAA TGACTTAGTC TCGGGTAAAG AAGGAAATGA CTTGCTCCGA
GGAGATGGAG GAACTGATAC CCTCACAGGT GGAGAGGGTG ATGACATCAT TAACGGTGGT
GCGGATAACG ACGATATCAA AGGAGAAGCA GGTGATGATA AACTTTTCGG TGGTGATGGA
GATGACCTAG TCTCTGGTGC AGAAGGAAAC GATCTACTGA AAGGAGATGG AGGAACTGAT
ACCCTCACAG GTGGAGAGGG TGATGACATC ATTGCTGGTG GTGATGGAGA TGACGAAATC
AAAGGAGAAG CCGATAATGA TAAACTTTTT GCTGGTGATG GAGATGACCT AGTCTCTGGT
GGAGAAGGAA ACGATCTACT GAAAGGAGAT GGAGGAACTG ATACCCTCAC AGGTGGAGAG
GGTGATGACA TCATTGCTGG TGGTGATGGA GATGACGACA TCAAAGGAGA AGCCGATAAT
GATAAACTTT TTGCTGGTGA TGGAGATGAC CTAGTCTCGG GTGGAGAAGG AAACGATCTA
CTGACAGGAG ATGGAGGAAC TGATACCCTC ACAGGTGGAG AGGGTGATGA CATCATTAAC
GGTGGTGCGG ATAACGACGA TATCAAAGGA GAAGCAGGTG ATGATAAACT TTTCGGTGGT
GATGGAGATG ACCTAGTCTC TGGTGCAGAA GGAAACGATC TACTGAAAGG AGATGGAGGA
ACTGATACCC TCACAGGTGG AGAGGGTGAT GACATCATTG CTGGTGGTGA TGGGGATGAT
GACATCAAAG GAGAAGCCGA TAATGATAAA CTTTTTGCTG GTGATGGAGA TGACCTAGTC
TCGGGTGGAG AAGGAAACGA TCTACTGACA GGAGATGGAG GAACTGATAC CCTCACAGGT
GGAGAGGGTG ATGACATCAT TAACGGTGGT GCGGATAACG ACGATATCAA AGGAGAAGCA
GGTGATGATA AACTTTTCGG TGGTGATGGA GATGACCTAG TCTCTGGTGC AGAAGGAAAC
GATCTACTGA AAGGAGATGG AGGAACTGAT ACCCTCACAG GTGGAGAGGG TGATGACATC
ATTGCTGGTG GTGATGGGGA TGATGACATC AAAGGAGAAG CCGATAATGA TAAACTTTTT
GCTGGTGATG GAGATGACCT AGTCTCGGGT GGAGAAGGAA ACGATCTACT GACAGGAGAT
GGAGGAACTG ATACCCTCAC AGGTGGAGAG GGTGATGACA TCATTAACGG TGGTGCGGAT
AACGACGATA TCAAAGGAGA AGCAGGTGAT GATAAACTTT TCGGTGGTGA TGGAGATGAC
CTAGTCTCTG GTGCAGAAGG AAACGATCTA CTGAAAGGAG ATGGAGGAAC TGATACCCTC
ACAGGTGGAG AGGGTAATGA CATCATTGCT GGTGGTGATG GGGATGATGA CATCAAAGGA
GAAGCCGATG ATGATAAACT TTTCGGTGGT AAAGGTAATG ACTTAGTTTC GGGTGGAGAA
GGAAACGATT TACTCCGAGG AGATGGAGGA AACGATACCC TCTCTGGTGG AGAGGGTGAT
GATATCATTG CTGGTGGTGA AGGTAATGAC GAAATCAAAG GAGAAACCGG TAATAATCAA
CTTTTTGCTG GTGAAGGTAA TGACCTAGTC TCAAGTGCAG AAGGAAACGA TTTACTCCGA
GGAGATGGAG GAGCTGATAC CCTCACCGCT GGAGAAGGTG ATGATACCAT TGACGGTGGT
GCGGATAACG ACGATATCAA AGGAGAAGCA GGTGATGATA AACTTTTCGG TGGTGATGGA
GATGACACAG TCTTAGGTGC AGAAGGAAAC GATTTACTCC GAGGAGATGG AGGAAACGAT
ACCCTCTCTG GTGGAGAGGG TGATGATATC ATTGCTGGTG GTGATGGAGA CGACGAAATC
AAAGGAGAAA CCGGTAATAA TAAACTTTTT GCTGGTGAAG GCAATGACCT AGTCTCAAGT
GCAGAAGGAA ACGATTTACT CCGAGGAGAT GGAGGAGCTG ATACCCTCAC CGCTGGAGAA
GGTGATGATA CTATTGACGG TGGTGCGGAC AACGACGATA TCAAAGGAGA AGCAGGTGAT
GATCAACTTT TTGGTGGTGA TGGAGATGAC ATAGTCTTAG GTGCAGAAGG AAACGATTTA
CTCCGAGGAG ATGGAGGAAA CGATACCCTC ACTGGTGGAG AGGGTGATGA CATCATTGCT
GGTGGTGAAG GTAATGACGA AATCAAAGGA GAAACCGGTA ATAATCAACT TTTTGCTGGT
AAAGGTAATG ACCTAGTCTC AAGTGCAGAA GGAAACGATT TACTCCGAGG AGATGAAGGA
AATGATACTC TCACTGCTGG AGGTGGTGAT GATAAACTTT TTGGTGGTGA TGGGGATGAT
GAGCTCACAG GAGAAGCAGG TGATGATCAA CTTTTTGCTG CTGAAGGTAA TGACCTAATT
TCAGGTGGAG AAGGAAACGA TCTGCTGAAA GGAGAAGGAG GAAATGATAC CCTCTCTGGT
GGTGAAGGTG ATGATACAAT TTTTGGTTGT CATGGGAGTG ATGAGATCAA AGGAGATGCA
GGTGATGACC TCATCATTAG TTATAGTGAT GCAGGAGAAC CAGATATTGC CCAAGATACA
GATCAACCTA AGGTTTATTC CGAGCAACCC TTTCTTGAAG CTCATGATAC TTTAACTGGA
GGAACAGGGG CTGATACTTT TGAGTTTAAA CTGTTAATTA ATGCCAAAGA CAATATTATT
CAAAAACACG CCGATCCTGT AACTGGTAAA ATCAACTGGC AAGGAGTCGC TGGCGAAAAT
GATAACCCCC ATGATCACTG GGTTGATGCT ATTGGTAATG ATGTCATCCT TGACTTTAAT
AAAAGTGAAG GGGATAAAAT TCAGATTTTA GGTCATACTG TACAAGTCAG GGAGATTGAA
AATCTTGACG ATGGAAATGG ATCTATTATT CACTTAATTA GTAATCAAGG TGGTAATGGT
GGAGCCCACG ACCAAGATGA ACTAGGAACA ATTACTGTTT ATGGTGATCT AGTTGAACAG
TCGGACTTAA CTGTCCGCGC TGGAGTTACC TTTGGGGTTC TTCACTCCAT CCCGATCAGT
GAGATTAATG CTGAACAAAT AGCAACGCCT ATGTCGAACT CTACGCATCT GGGTGACTGT
GGGTGTTGCT GA
 
Protein sequence
MAKFISDTES SAHGEEGNDL EPKDLSIDTD KTVDSDEGDD DSKEETSAEG NEPAPAVEES 
DTQEGEEGNN TDSGEDADET VDSDEGDDDS EEETSAEGND PTLAAEESDT QEGEEGNDTD
SGAGEDADET VDSDEGDDDS KEETSAEGNE PAPAVEESNT EEGEEGNNTG SGEDADETVN
SDEGDDDSKE EMSADGNDPA PATEESDTQK GEGENERLII SGTEGNDRLL GVEVNEEIKG
KAGDDKLFGG GGDDLVLGGE GKDFLRGDGG HDTLSGGEGD DIIDGGDGDD DIKGEAGDDK
LFGGKGNDLV SGKEGNDLLR GDGGTDTLTG GEGDDIINGG ADNDDIKGEA GDDKLFGGDG
DDLVSGAEGN DLLKGDGGTD TLTGGEGDDI IAGGDGDDEI KGEADNDKLF AGDGDDLVSG
GEGNDLLKGD GGTDTLTGGE GDDIIAGGDG DDDIKGEADN DKLFAGDGDD LVSGGEGNDL
LTGDGGTDTL TGGEGDDIIN GGADNDDIKG EAGDDKLFGG DGDDLVSGAE GNDLLKGDGG
TDTLTGGEGD DIIAGGDGDD DIKGEADNDK LFAGDGDDLV SGGEGNDLLT GDGGTDTLTG
GEGDDIINGG ADNDDIKGEA GDDKLFGGDG DDLVSGAEGN DLLKGDGGTD TLTGGEGDDI
IAGGDGDDDI KGEADNDKLF AGDGDDLVSG GEGNDLLTGD GGTDTLTGGE GDDIINGGAD
NDDIKGEAGD DKLFGGDGDD LVSGAEGNDL LKGDGGTDTL TGGEGNDIIA GGDGDDDIKG
EADDDKLFGG KGNDLVSGGE GNDLLRGDGG NDTLSGGEGD DIIAGGEGND EIKGETGNNQ
LFAGEGNDLV SSAEGNDLLR GDGGADTLTA GEGDDTIDGG ADNDDIKGEA GDDKLFGGDG
DDTVLGAEGN DLLRGDGGND TLSGGEGDDI IAGGDGDDEI KGETGNNKLF AGEGNDLVSS
AEGNDLLRGD GGADTLTAGE GDDTIDGGAD NDDIKGEAGD DQLFGGDGDD IVLGAEGNDL
LRGDGGNDTL TGGEGDDIIA GGEGNDEIKG ETGNNQLFAG KGNDLVSSAE GNDLLRGDEG
NDTLTAGGGD DKLFGGDGDD ELTGEAGDDQ LFAAEGNDLI SGGEGNDLLK GEGGNDTLSG
GEGDDTIFGC HGSDEIKGDA GDDLIISYSD AGEPDIAQDT DQPKVYSEQP FLEAHDTLTG
GTGADTFEFK LLINAKDNII QKHADPVTGK INWQGVAGEN DNPHDHWVDA IGNDVILDFN
KSEGDKIQIL GHTVQVREIE NLDDGNGSII HLISNQGGNG GAHDQDELGT ITVYGDLVEQ
SDLTVRAGVT FGVLHSIPIS EINAEQIATP MSNSTHLGDC GCC