Gene Tery_1733 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1733 
Symbol 
ID4245390 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2638093 
End bp2640024 
Gene Length1932 bp 
Protein Length643 aa 
Translation table11 
GC content36% 
IMG OID638106859 
Productpeptidase S1 and S6, chymotrypsin/Hap 
Protein accessionYP_721468 
Protein GI113475407 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.635761 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTTTTAG AATTATCTAT TGTTAGGATA TTTAAATCTG GAGGTGGTGT CGCTGGCTCT 
GGTTTTTTGG TTTCTAATGA GTATATTTTA ACTTGTGCTC ATGTAGTGGC TTATTGTTTA
GATACTCCTA AAAAAACTGC ACATATAATT ATGAGGCAGA AAGAAATTCC TGATGAAATT
ATTGAGGTTA ATTTTCCTAT TTTTGAGAAA GGTAAGATAG GTGAAAAACT TGAGACAAAA
GTGACTTTTT GGCGACCTCT AAATGATCAG GAAAATATAC AAGATATTGC AGTATTAAAA
CTAATAAATT CTGATCTACT TCCCGAAGAT GCTAAACCAA TTAATTTAAT TCAAATAGGA
AATCAATCTC TTAAGGAAAA TGAGTTTGAA GCATTAGGTT TCCCAAAAAA AGGGAGCGAT
GGGGAGTGGG CTACTGGAAA ATTGATGGGA CCAATAGGCC GAGGTCTTAT CCAACTTGAG
GGTACTAAAC AGACAGGGCT TCGCTTGGAG TCAGGGTTTA GTGGTACTGC TATCTGGGAC
AAAAATCTTC AGGGTGTTGT GGGTATGGCG GTAAAAGCAG ACCAGGAACG CCCTGAGGCT
AAAGTGGCTT TTATGATTCC CACGGACCTA ATACTTCAAG TTGGGGATTT GGCCACGGTT
TGTCGCGTTG ATGGTAGGAT ACAAGGAGCG ATCGCTATTT TGGAAAATTA TTTTGAAGAT
TATACCACAG AAATACGCTA TGCTTACAAC CTATCTTTGC CAGAAATTTC TATTCCTTTA
AGTTCTGGGA AATCTCTAGA TGAGTACCCT GGATCTTTGA ATGAGATGAT TCAAAATTTG
GATGACCGGA CAAAGGAAAA TTATTCTTTG CTGGAGAGAT TTATCTGTTT TTTATTACTG
CATCTTGAGG ATTTAAAAAA ACCTTCTGAG CTTTGCCAGA AATTAACAGA ATGGTTGGAA
AAATATTCGC AAAATATTGA AGATTTGAAG GCTGTTTTAA GAAAAGAGAA AGCTCTGAAA
AATCAGGAAA ATTTTCAGAT TAAACAACCA GAACCTTATC TATTAGTAGC TGCTATTGAA
AAGAGTAAGG GTTTTATTTT GAAGGCATGG CTAATTGAAA ATCCTCAAAA TTATACTCCG
GAAAATCCTC AAGGTTTCCA TTCTTTTATT GATGAAGAAA ATGTTCTTAT GAATGCAAAA
GGAACTATAG TTAGTAATAA AAATACTTCG GAGTTTAACC AAGCTAAAAA TTTGACTGAA
TTATTACAGT TCTTTTGGGC TGATGTTTCG GAACGTTATG ATTTTAATCT AGAGAAAATT
GCAATATTTT TGCCTTATAA ATTAATTGAC CGGGATATTA AGCCGGTAGA TCAATATATA
AGTGATCCGA ATATACCTGA GTATTTTCAA ACTTTATTGG GAGAACAGTG TGAAATCACT
CTCAGGTTTT CAGAACGCCT TAGGCTGTCT GGAAATTCTA ATGCTGAATT AAATAAATTT
AACCAAAAGT GGCGATCGCT AACTAGCAGA CAAAGTGCAA GAGTAATTGA TATTTTTCAT
CCTTCGGCTA CTAGTGGTAA TAGGAAGAAG TTTTTTCGAC AAATTTTTGC TGACGATGTG
GCGGCAGTAA GATTAACAGA AGTGCTGCAA CCAGAGAAAC GAGAATCAGT TATGGAAGCT
TTTTACTATG CTGGAATTCC TGTGGCGCTA TGGATGAGGC CAGAAGCAGA AAATATTGAC
TGTGCTGAGG AACTGCAAAA TATCTGTAAC GCTTGTAACT CTTTGTCAAA TTTACCTAAA
GCTATAAAAG CAAAACGTTC TGAAGCTTGG GAACAAGATA TTGACAGACA TATTGGTAAT
CACTTATCAT TACTTTGGGA GGATCCTGAT ATTGTTCCAC CTGTTAATGA ACTGAGGATG
TTGGAGTCAT GA
 
Protein sequence
MVLELSIVRI FKSGGGVAGS GFLVSNEYIL TCAHVVAYCL DTPKKTAHII MRQKEIPDEI 
IEVNFPIFEK GKIGEKLETK VTFWRPLNDQ ENIQDIAVLK LINSDLLPED AKPINLIQIG
NQSLKENEFE ALGFPKKGSD GEWATGKLMG PIGRGLIQLE GTKQTGLRLE SGFSGTAIWD
KNLQGVVGMA VKADQERPEA KVAFMIPTDL ILQVGDLATV CRVDGRIQGA IAILENYFED
YTTEIRYAYN LSLPEISIPL SSGKSLDEYP GSLNEMIQNL DDRTKENYSL LERFICFLLL
HLEDLKKPSE LCQKLTEWLE KYSQNIEDLK AVLRKEKALK NQENFQIKQP EPYLLVAAIE
KSKGFILKAW LIENPQNYTP ENPQGFHSFI DEENVLMNAK GTIVSNKNTS EFNQAKNLTE
LLQFFWADVS ERYDFNLEKI AIFLPYKLID RDIKPVDQYI SDPNIPEYFQ TLLGEQCEIT
LRFSERLRLS GNSNAELNKF NQKWRSLTSR QSARVIDIFH PSATSGNRKK FFRQIFADDV
AAVRLTEVLQ PEKRESVMEA FYYAGIPVAL WMRPEAENID CAEELQNICN ACNSLSNLPK
AIKAKRSEAW EQDIDRHIGN HLSLLWEDPD IVPPVNELRM LES