Gene Tery_1355 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1355 
Symbol 
ID4241852 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2073515 
End bp2075386 
Gene Length1872 bp 
Protein Length623 aa 
Translation table11 
GC content41% 
IMG OID638106530 
Producthemolysin-type calcium-binding region 
Protein accessionYP_721141 
Protein GI113475080 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.530325 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones10 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGACAT TAAGTGACGC AGACAATAAC TTTGATCAGG ACCAATCAGA CCAGATGAAA 
CCTGATACCA TTAATGCTAT TGGTGGAGAT GACATAATAT TAAGTTCTAC CCTGGGAGGT
AGTCTAATTA GAGGTGGAAA TGGAAACGAT ACTTTGCTGT CCAGAGGGCC TGCTGGAATT
GGCAACTCTG GTGAAGGTGA CACTCTTGAA GGGGGGAATG GAGATGATAC CTTAGACTTC
AGAGCAGCAG GAGGCACTGG ATTTGGAGAT GAGGGCAATG ATACCTTAAA AGCTATTACC
CAAACTACCC TGTATGGAGG TAATGGGGAG GACTCTCTCA TAAGTTTGAT TGAAGGAAAC
TATCTATCAG GCAATCGAGA TAATGATAGA TTCTTCATAA CAGAGAGAGA TACTGTTTAT
GGAGGTAAAG GTAATGATAC TATTCTTAAT AGGGATACCT CAACTCCATC AGAAAATTCT
GGGTTTAATT TTATATCTGC AAACAATAAC GAAGATTATG TCAAGGTGTC TGGTGATCGC
GATACTATCT ATGGTGGTAA GGAGGACGAT ACACTAATTT CTTCGGGTAG TCGAGTCTTT
ATAAGTGGAG ATAAAGGTAA TGACACAATC ACTAACTCGG AAGAAAGGTC TACCCTACTT
GGTGGTGAAG GTGATGATTC GATTATGGGC GGGCTTGATC CATCTCCTAG TGATGAGAAT
AATAATAGTA GAAATAGTCT AGAGGGTGGT GCGGGTAGCG ATACTCTCAT TGGTCGAGGT
GCGAGGGACA CTTTGATTGG TGGTGAAGGT AATGACTCTA TTGTAAGCAA GTCTACTGAT
GAAGGTTCTG ATGGTCAAAA CAGATTGGAT GGTGGTGCAG GTAGCGATAC CTTAGTAGGG
GGATATGTAA CTGATACAAT GATCGGGGGT GATGGTAATG ACTCCCTTAG TGGTATATTC
ACAAAGGGAG ATGGTGGTGG AGGTAATGAC ACCATAAATG CTACTGCTGC AGGTACTGAT
GCTGCTCAAA TTACCCTTAT TGGTGGTGCA GGTAATGACA GTTTACTAGG TAATACTAGT
GCTGGTGTAT CTAACTTCTT CGATGGTGGT ACAGGCAATG ACTTTATCCA GTTTGGATCA
ACTAACGATC AGTTAATTGG GAACAACAGA GGTAACGATA CTATTGCTGC GGTTGGAACT
AGTACATCAG GATTTAATAT TCAAGATACC TTCGGCAATA ATACTATTAC TGGTGGTAAA
GGAAATGACA CACTAGTTAC TGGAGGAGGT AATGACTCTA TTACTGGTGG TAACTCAGTC
AGTGAAGATT CCCCAGACGA AGGAGATGAT CGTATTGTTG CTGGTGCAGG GAATGACACA
ATATTCGGAC GTGGTGGCAA AGATACAATT ATTGGTGGTG ATGGTGATGA CTATATTGTA
GGAGGTCTTG ACAGTACAGG CGATTCATTA ATTGGTGGTG CAGGAAACGA TAGTTTCGTT
TATTTTGGTT TGGATGGTAA GGACCAAAGT CCAAATAGTG TTATTGTAGA TTTCGATACT
GCTGACGACA AAATACTGCT TCGAGATCAA GGATTTAATT TAGCCAACTC TGAGGCAGGT
TCAACAATTG CTAATGCTGA TTTGGTTCTT ATTGACCCAG GCAATAACTA TAGCAACGAT
GATGCTCGAA GTACTAATCC TACAATTATC TACGAGCCTG AGCCTCTAAA CGGAAATGAA
AGAATAGATT CTGGTTTATT AAAGTACGAT CCTAGTGGTA GTGGTGGTCC AGATGATAAC
AGCGATGTTG TTACTATTGC TCGTCTCAAC GGTAATCCAG GTCTTGAGAA CAGCGATATT
TTAATTATCT AA
 
Protein sequence
MVTLSDADNN FDQDQSDQMK PDTINAIGGD DIILSSTLGG SLIRGGNGND TLLSRGPAGI 
GNSGEGDTLE GGNGDDTLDF RAAGGTGFGD EGNDTLKAIT QTTLYGGNGE DSLISLIEGN
YLSGNRDNDR FFITERDTVY GGKGNDTILN RDTSTPSENS GFNFISANNN EDYVKVSGDR
DTIYGGKEDD TLISSGSRVF ISGDKGNDTI TNSEERSTLL GGEGDDSIMG GLDPSPSDEN
NNSRNSLEGG AGSDTLIGRG ARDTLIGGEG NDSIVSKSTD EGSDGQNRLD GGAGSDTLVG
GYVTDTMIGG DGNDSLSGIF TKGDGGGGND TINATAAGTD AAQITLIGGA GNDSLLGNTS
AGVSNFFDGG TGNDFIQFGS TNDQLIGNNR GNDTIAAVGT STSGFNIQDT FGNNTITGGK
GNDTLVTGGG NDSITGGNSV SEDSPDEGDD RIVAGAGNDT IFGRGGKDTI IGGDGDDYIV
GGLDSTGDSL IGGAGNDSFV YFGLDGKDQS PNSVIVDFDT ADDKILLRDQ GFNLANSEAG
STIANADLVL IDPGNNYSND DARSTNPTII YEPEPLNGNE RIDSGLLKYD PSGSGGPDDN
SDVVTIARLN GNPGLENSDI LII