Gene Tery_1884 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1884 
Symbol 
ID4242689 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2878439 
End bp2880205 
Gene Length1767 bp 
Protein Length588 aa 
Translation table11 
GC content40% 
IMG OID638107005 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_721613 
Protein GI113475552 
COG category[Q] Secondary metabolites biosynthesis, transport and catabolism 
COG ID[COG2931] RTX toxins and related Ca2+-binding proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0760067 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTAGCAG TTTCTAGCAA TAACTATAAC GATGAAAAAT TTATCCCCTC TTTTGATGAA 
TTTACTGGAG TTGTCTATTT TGAAAATATT AAGACGCAAG TAAGTTGCAC AGGGGCTTTA
CTAGATAGTA ATGGCCTTTA TATTTTGACG GCTGCCCATT GCTTTAATAA GCAGAATGAT
TCGGCAAACT TAAATCCTAA CCCTAATAAT TATAAAGTCT TTTTTGAAGT TAATGGTACT
CTTAAATCAA GACTTGTCGA AGAGATTTTT GTTCATCCTG AGTGGACATC TGATGAAAAT
AGTAATAACG ATATTGCTAT CATTAAACTC TCTAATGAAG CGCCTGATGT TGAGAGCTAC
GATATATATC GTGATACTGA TGAGGTGGAT CAGGTTTTTA CCCGTGTGGG TTACGGTTTC
CCTGGAACTG GTAGAGACGG TCAGATTGAT GATTCAGGAG AAGACCCCCC TGTCAAGCGT
TTTGGACAAA ATTATTATGA TGCCTTGGGT GAGATTTTCA ATGATTATGA TTATGATACT
CCAATAATAA AAGGTACACA GCTAGCCTTC GACTTTGATA ATGGGAAACC AAGGAGGGAT
GCTTTTGGTC GTGAATACGG TTTGGATCAG TTGGGAGTGG ATCAAGAGGT TAATTCTACA
GCAGGAGATT CAGGGGGTCC GGCCTTTATT GATGGGAAAA TAGCTGGGAT CACATCCTAT
GGTTTTTCTT CAGGTATGTA TGATACAGAT GTAGATGATG AAAGTGGTAA TTCTAATTTC
GGAGAATACA GTGTTGATAT CAGGGTTTCA GCTTATATAG AATTTATTAC AGACATTACA
TCCTTATCAC TTGAGGGAAG TAGGGGTGAT GATACTATAA AAGGAGGTAT TGGTAACGAT
ACTATTAATG GAGCTTCTGG TCAAGATAGA CTTTTGGGAA AATCTGGCAA TGACTTCTTA
GTAGGTAGCT TTGGTAATGA TGCTTTGTTA GGGGAAGCAG GTGATGATAT TCTAAAAGGC
GGTGGGGGTC GCGATCGCTT GAACGGTGGT ACTGGCAACG ATACACTTAC TGGTGATGGA
GGTAATGACC GCCTCAACGG TAGTGGAGGT AATGACCGCC TCAACGGTAA TACTGGCAAC
GATATACTTA CTGGTGGTGG AGGTAATGAC CGCCTCAACG GTGGTGGAGG TAATGACCGC
CTCAACGGTA ATACTGGCAA CGATATACTT ACTGGTGGTG GAGGTAATGA CCGCCTCAAT
GGTGGTGGAG GTAATGACCG CCTCAATGGT AATAATGGCA ACGATATACT TACTGGTGGT
GGAGGTAATG ACCGCCTCAA CGGTGGTGGA GGTAATGACC GCCTCAACGG TGGTGCTGGA
AATGATATAC TTATTGGTGG TGGAGGTAAT GACCGTTTTA TCTTTAATAG TAATGAAAAA
TTTGACTCCA ATGATTTTGG TATTGATACT ATCAAAAATT TTGAACCAGA CCTCAGACCT
GATGAAGATG GAAGCCAAGG TGACTTAATT GTTCTTGACA AGTCAAGCTT TACTGACCTT
GGTAGTTCTA CTCGTATTGG TTTCAGTGTT GATAGTGATT TTGAAATCGT TAATACTAAT
AATAGTATAG ATGACTCTAA TGCTTTTATT GTTTACAATG AAGAGTCTGG AAATCTATTC
TACAAATCAG ATGGTGAGTA TACTAAGTTT GCTATTCTTA ATGGTGCGCC CACTATTACT
GAAGATAATT TCCAAATTAT CAATTAG
 
Protein sequence
MVAVSSNNYN DEKFIPSFDE FTGVVYFENI KTQVSCTGAL LDSNGLYILT AAHCFNKQND 
SANLNPNPNN YKVFFEVNGT LKSRLVEEIF VHPEWTSDEN SNNDIAIIKL SNEAPDVESY
DIYRDTDEVD QVFTRVGYGF PGTGRDGQID DSGEDPPVKR FGQNYYDALG EIFNDYDYDT
PIIKGTQLAF DFDNGKPRRD AFGREYGLDQ LGVDQEVNST AGDSGGPAFI DGKIAGITSY
GFSSGMYDTD VDDESGNSNF GEYSVDIRVS AYIEFITDIT SLSLEGSRGD DTIKGGIGND
TINGASGQDR LLGKSGNDFL VGSFGNDALL GEAGDDILKG GGGRDRLNGG TGNDTLTGDG
GNDRLNGSGG NDRLNGNTGN DILTGGGGND RLNGGGGNDR LNGNTGNDIL TGGGGNDRLN
GGGGNDRLNG NNGNDILTGG GGNDRLNGGG GNDRLNGGAG NDILIGGGGN DRFIFNSNEK
FDSNDFGIDT IKNFEPDLRP DEDGSQGDLI VLDKSSFTDL GSSTRIGFSV DSDFEIVNTN
NSIDDSNAFI VYNEESGNLF YKSDGEYTKF AILNGAPTIT EDNFQIIN