Gene Tery_3664 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3664 
Symbol 
ID4243970 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5623544 
End bp5625580 
Gene Length2037 bp 
Protein Length678 aa 
Translation table11 
GC content42% 
IMG OID638108611 
Producthemolysin-type calcium-binding region 
Protein accessionYP_723199 
Protein GI113477138 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.157401 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.209171 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAGTCAC CCTTACAACA ACAGCCAATT CCGAATCATG GAAAATTTAA CCATATACAT 
CCTATCCTCA ACCCTGAAGT CCCAGTTGTC ACACCTGAAA ATCCTTTGCC TACTATTTCA
CGACTATGGG ATGAACTAGC CCAAGAAGCA GTCATAAACT CCGTCTCCGG TCCAACCATT
GCCTCTCGTG CTTATGCAAT GGTACATACA GCCATGTATG ATGCCTGGAG TGCCTATGAT
CCATTAGCTA TAGGAACTCA GTTGGGAGAT GACTTACAAC GCCCCCTATC CGAAAACACC
AACTTTAATA AAACTCAAGC CATGAGTTTT GCTGCTTACC GGGTATTGTC AGAATTATTC
CCCGCTCAAG TAGAAACTTT CAATGAATTA ATGGTCAAAT TAGACCTAGA AACCAACAAT
ACTACTACTG ATACTACTAC CCCTGCTGGT ATTGGTAACG TCTCTGCCGA AGCACTATTA
AGATTCCGGT CCAACGACGG TTCAAATCAA CTCGGGGATG ATCCCAATGG CGACGGTACC
CCCTATTCTG ATATTAGCAG CTATGAACCT ATTAATTTTC CAGGAAGTTC CATTAATATA
GAGCGCTGGA CCCCAGAACC AGTTCCCATT GATGCCGAGC CCGGTGAAGA AGATCGAATA
CAAACATTCC TCACACCTCA ATGGGGAAAA GTCATCCCCT TCAGTTCTAT ATCAGTTGAA
AAAATACGAC CCCAACCACC AAAACCATTC TTGCTTGTAG ATGGCGAGGT TGACTTAGAT
GCAAGTACAA TTACATTACC AGATGGCTCT GTTGTCGCAA TTAGCAAAGA TATTGTTGGA
ACTATTATCA ACCCTGAATT TATCGCACAA GCAGAAAATA TTGCCAACGT TCGCGCCAAC
TTAACTGATG AGCAAAAATT AATTGCTGAA TTTTGGGAAG ACGGCCATGG TACATCTTAC
CCTCCGGGAA CCTGGATGAA TTTTGGAGAA TTTATCTCAG CCAGAGATAA TCATACTTTA
GATGAAGATG TGAAACTCTT CTTTAGTTTG GGCAATGCAT TATTTGATGC TGGCATTGCT
GCCTGGGGAT CTAAAGTTTT CTACGACTAT GCTCGTCCCG TGCGAGTAGT ACGGGAACTC
GGAGCACAAG GTTTAATTGG AGAATTTAAT CCAGAGCTTG GTGGTTTTGC TATTGATAGT
TGGAAGGATC CTACAGAAGG AACTGCAACT ATTCTTGCAA CTAATTTCCT TACTTATCAA
GCCCCTGGAG AGGAGCCTTC ACCTCCTTTT GCTGAGTATA TATCAGGTCA TAGCAGTTTC
AGTGCTGCAG GAGCTGAAAT TCTCAAACGG TTTACTGGTA GCGATGATTT TGGTGGTGGC
ATAACTTTTG AGGTAGGCGA GTCTGTTTTT GAACCAGGTA TCACACCTAA AGTGCCTGTA
ACTCTTGAGT GGAATACTTT TAGCGAAGCT TCTGATCAGG CTGGTATATC TCGTATTTAC
GGCGGTATCC ACTTTGAAGA TGGGGATCTC AATGGGAGGG CACTAGGGCG AGAAGTCGCA
GAACATGTTT GGGAAAAAAC TCAAGGAGTA ATTACTCCCA ATACTATTAT TGCTACTAAT
GAAAAGGATA ATTTAATTGG CTCTGTCACT AACGACTTAA TATATAGCAA CCGCAGTGAT
GATATTGTTT TTGGTAACGA GGGCAATGAT ATGCTTTGTG GAGGCAAGGG TAACGACATC
GTCAATGGTG GTGCAGGAGC AGACCTGATA TACGGTGATT TTGGCGATGA TATTCTCATT
GGTGGAGTTG GTGGCGATAA CTTTCATTTC AGGTCTAATG ATGGGAACAA TATTATTACT
GATTTTGAAG ATGGAATAGA TGTTATTGGT TTAGGTGATG GTTTAAGTTT CCAGCAGTTA
ACTATTTCTC AAATAGGTAA TGATACTCGA ATTAGTGCCA ACCAACTTTC AATTACATTG
CAGGGCGTCG AAGAAAGTGC CATAAATATT GAAGATTTTA GGGATTGTAA AATATAA
 
Protein sequence
MESPLQQQPI PNHGKFNHIH PILNPEVPVV TPENPLPTIS RLWDELAQEA VINSVSGPTI 
ASRAYAMVHT AMYDAWSAYD PLAIGTQLGD DLQRPLSENT NFNKTQAMSF AAYRVLSELF
PAQVETFNEL MVKLDLETNN TTTDTTTPAG IGNVSAEALL RFRSNDGSNQ LGDDPNGDGT
PYSDISSYEP INFPGSSINI ERWTPEPVPI DAEPGEEDRI QTFLTPQWGK VIPFSSISVE
KIRPQPPKPF LLVDGEVDLD ASTITLPDGS VVAISKDIVG TIINPEFIAQ AENIANVRAN
LTDEQKLIAE FWEDGHGTSY PPGTWMNFGE FISARDNHTL DEDVKLFFSL GNALFDAGIA
AWGSKVFYDY ARPVRVVREL GAQGLIGEFN PELGGFAIDS WKDPTEGTAT ILATNFLTYQ
APGEEPSPPF AEYISGHSSF SAAGAEILKR FTGSDDFGGG ITFEVGESVF EPGITPKVPV
TLEWNTFSEA SDQAGISRIY GGIHFEDGDL NGRALGREVA EHVWEKTQGV ITPNTIIATN
EKDNLIGSVT NDLIYSNRSD DIVFGNEGND MLCGGKGNDI VNGGAGADLI YGDFGDDILI
GGVGGDNFHF RSNDGNNIIT DFEDGIDVIG LGDGLSFQQL TISQIGNDTR ISANQLSITL
QGVEESAINI EDFRDCKI