Gene Tery_2809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2809 
Symbol 
ID4245343 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4358134 
End bp4361091 
Gene Length2958 bp 
Protein Length985 aa 
Translation table11 
GC content39% 
IMG OID638107861 
Producttetratricopeptide TPR_2 
Protein accessionYP_722458 
Protein GI113476397 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG2956] Predicted N-acetylglucosaminyl transferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00683217 
Plasmid hitchhikingNo 
Plasmid clonabilityunclonable 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATAAAA AATATCAAAT ACTTGAAAAA ATACAGGGAT TATCTATTTA TCAATTAAGT 
AATACTTACT ATGAAAAATC TGGTAAATTA ATCACTGTTA AACCTCAAGA AAAGAAAAAT
ATTTTTCTGA AAAATAGTAT AGAAATCATC AAAAAAATTA CTAGGTTGAT GAAAATTAAC
TCTCCCCAAA ATAAACTAAA TTTATTAGAA AATAACTACT CGCCTCAACC AGAAAAAAAA
TTAACGGTGA GAGAAAAAGC TAACCTCCGC CTCGCCTATT ATTGGTTAAA AATATATCAA
CCAGAACCAG GTGCATCTAA CTTGGAAAAA GTCAAAGGAT ATCTGCAAGC CTTCCGTCAT
TTTGCCAAAA TGGGAGAGTG GGAAAAAGCT AGTCAAATTC TTTCTCTTAA ATTACCGCTA
GTTGATGAAA ATTTAGACAC TCAACTAAAT AGATGGGGTT ACTATGAGGA ATGTATGGAA
CTCTACAATA AATTATTAGG TAAATTAGAC CAAAGCTGGG ATGGTATTTG TTTAAATGGT
TTGGGAAATG CTTACTTATC TCTCGGAAAA TATCACAAAG CTATTGAGCA TTATCAGCAG
CACTTACAAA TAGCTAAGGA AATAGGTGAT CTTGGGGGAC AGGGTATTGC TTTAGGGAAT
TTGGGAGATG CTTACCATTC TCTAGAAGAA TATAACAAAG CTATTGAGTA TCATCAGCAG
CATTTACAAA TAGCGAAGGA AACAGGGGAA CTCGGGGGAG AAGGTATTGC TTTAGGGAAT
TTGGGAAGTG CTTACTATTC CCTGGGAAAA TATCACAAAG CTATTGAGTG TCATCAGCAG
CATTTACAAA TAACAAAGGA AATAGGGAAT ATGAAACAGC AGGGCATTGC TTTAGGCAAT
TTAGGGAATG CTTACCATTC CCTAGGAGCA TATCACAAAG CTATGGAATA TCATCAACAG
GACTTAGAAA TAGTTAGGAA AATAGGCGAT CTTGGGGGTG AGGGCATTGC TTTAGGGAAT
TTGGGAAGTG CTTACTATTC CTTGGGAGAA TATCACAAAG CTATTGAGTC TCATCAGCAG
CATTTACAAA TAGCTAGAAA AATAGGCAAT GTTAAACAGG AGGGTATTGC TTTGGGGAAT
TTGGGTAATG CTTACCATTC CCTAGGAGCA TATCACAAAG CCATGGAATA TCATCAACAG
GACTTAGAAA TAGTTAGGAA AATAGGGGAT CCTGGGGGTG AGGGTAATGC TTTGGGAAAT
TTGGGTAATG CTTACTATTT TTTGGGGGAA TATCACAAAG CTATTGAACA TCATCAGCAG
CATTTACAGA TAGCCAGAGA AATAGGTAGC CAGAAAGGAG AGGGTAATGC TTTGGGAAAT
TTGGGTAATG CTTACTATTC TTTGGGGGAA TATCACAAAG CTATTGAGCA TCATCAGCAG
CATTTACAAA TAGCCAGAGA AATAGGCAAC CAGAATAGAG AGGGTAATGC TTTGGGGAAT
TTGGGTAATG CTTACTATTC TTTGGGAGAA TATCACAAAG CTATTGAGCA TCATCAGCAG
CATTTACAAA TAGCCAAAGA AATAGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGAGAAT
TTGGGTAATG CTTACTATTC CAGCAGAAAA TACGACAAAG CTATTGAACA TCATCAGCAG
TATTTACAAA TAATTAGAGA AATAGGCGAT CGCTCTGGAG AAGGAAATGC TTTGAGGAAT
TTGGGTAATG CTTACTATTC TAGCAGAAAA TACGACAAAG CTATTGAAGA TCATCAGCAA
TATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGAGAAT
TTGGGAAATG CTTACTATTC CAGCAGAAAA TACGACAAAG CTATTGAACA TCATCAGCAG
TATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGGAAT
TTGGGTAATG CTTACTGTTC CCTAGGAGAA TATTACAAGG CTATTGAACA TCATCAGCAG
CATTTACAAA TAGCCCAAGA AGTTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGGAAT
TTGGGTAATG TTTACTATTC TCTAGGAGAA TATCACAAAG CTATTGAACA TCATCAGCAG
CATTTACAAA TAGCCCAAGA AATTGGCGAT CGCTCTGGAG AAGGAAATGC TTTGGGAAAT
TTGGGAAATG CTTACTGTTC CCTAGGAGAA TATCACAAAG CTATTGAGCA TCATCAGCAG
CATTTACAAA TAGCCCAAGA AATAGGCGAT CGCTATGAGG AGGGTAGTGC TTTGGGGAAT
TTGGGTAATA CTTATGATTT TTTGGGAGAA TACGACAAAG CTATGGAGTA TCATCAGCAG
CATTTACAAA TAGCTAGGGA AATTGGCGAT CGCTCTGGAG AGGGTAATGC TTTGGGGAAT
TTGGGAAATG CTTACTATTC CCTGGAAGCA TATCACGAAG CTATTGACCA TCATCAGCGG
CATTTACAAA TAGCAAAGGA AACAGGCAAC TTTAGAGGGG AGGGTGTTGC TTTGGGGAAT
TTAGGAAATG TTTACTTATC CCTAGGAGAA TATTACAAGG CGATCGAGTC TTATCAGCAG
AGCTTAGAAA TAGCTAGAGA AATAGGTAAC CGTTCTGGGG AAGGTGGTGC TTTGGGAAAC
TGGGGAAAAA CTCTCCTCAA ATTAGAAAAG TATGCCGACT CTTTGGAATA TTCACAAGCA
GCATTAGAGA TATTTGAGCA GATAGGAAAC CCTCATCACC AAGCAATAGT TCTGAAGAAT
ATAGCAGAAA CCTATCAAAA TTTGGGGTTT TGGGATATGG CACGGCGGCA TTGCGAGGAG
GCGTTGGCTA TTTTCACAGA ATTAGGAGTG CCAGAGTTGA GAGAATGTCA GGAGTTGTTT
GAGGTTCTTG AAAAATAA
 
Protein sequence
MDKKYQILEK IQGLSIYQLS NTYYEKSGKL ITVKPQEKKN IFLKNSIEII KKITRLMKIN 
SPQNKLNLLE NNYSPQPEKK LTVREKANLR LAYYWLKIYQ PEPGASNLEK VKGYLQAFRH
FAKMGEWEKA SQILSLKLPL VDENLDTQLN RWGYYEECME LYNKLLGKLD QSWDGICLNG
LGNAYLSLGK YHKAIEHYQQ HLQIAKEIGD LGGQGIALGN LGDAYHSLEE YNKAIEYHQQ
HLQIAKETGE LGGEGIALGN LGSAYYSLGK YHKAIECHQQ HLQITKEIGN MKQQGIALGN
LGNAYHSLGA YHKAMEYHQQ DLEIVRKIGD LGGEGIALGN LGSAYYSLGE YHKAIESHQQ
HLQIARKIGN VKQEGIALGN LGNAYHSLGA YHKAMEYHQQ DLEIVRKIGD PGGEGNALGN
LGNAYYFLGE YHKAIEHHQQ HLQIAREIGS QKGEGNALGN LGNAYYSLGE YHKAIEHHQQ
HLQIAREIGN QNREGNALGN LGNAYYSLGE YHKAIEHHQQ HLQIAKEIGD RSGEGNALEN
LGNAYYSSRK YDKAIEHHQQ YLQIIREIGD RSGEGNALRN LGNAYYSSRK YDKAIEDHQQ
YLQIAQEIGD RSGEGNALEN LGNAYYSSRK YDKAIEHHQQ YLQIAQEIGD RSGEGNALGN
LGNAYCSLGE YYKAIEHHQQ HLQIAQEVGD RSGEGNALGN LGNVYYSLGE YHKAIEHHQQ
HLQIAQEIGD RSGEGNALGN LGNAYCSLGE YHKAIEHHQQ HLQIAQEIGD RYEEGSALGN
LGNTYDFLGE YDKAMEYHQQ HLQIAREIGD RSGEGNALGN LGNAYYSLEA YHEAIDHHQR
HLQIAKETGN FRGEGVALGN LGNVYLSLGE YYKAIESYQQ SLEIAREIGN RSGEGGALGN
WGKTLLKLEK YADSLEYSQA ALEIFEQIGN PHHQAIVLKN IAETYQNLGF WDMARRHCEE
ALAIFTELGV PELRECQELF EVLEK