Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2167 |
Symbol | |
ID | 4242619 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 3382538 |
End bp | 3385582 |
Gene Length | 3045 bp |
Protein Length | 1014 aa |
Translation table | 11 |
GC content | 34% |
IMG OID | 638107273 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_721873 |
Protein GI | 113475812 |
COG category | [O] Posttranslational modification, protein turnover, chaperones |
COG ID | [COG3914] Predicted O-linked N-acetylglucosamine transferase, SPINDLY family |
TIGRFAM ID | |
|
|
Plasmid Coverage information |
Num covering plasmid clones | 7 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 8 |
Fosmid unclonability p-value | 0.796808 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGGACTACC AAAAATTCAT CCAACAACTT CCCACTAATT ACCAAAACTG GGGCAATAAC TCCCTTCAAC CATCTGACCA ATGGCCAAAA ACCCTCACCC CTAAACCCAA CGTCAACACC CTCAACCTCA TGAAACTACT CAACAGCGCT GTAGAACACA CAGAACCAGA TGAGATTTAC TGCGAGATAG GCACTACTCA AGGTCTCACT TTAATCGGAG CATTATTTGA GCATACCGAA AAAATGGCCT ATGCAGTTAA CAACTTTTCT AACTTTGATT CTACTGGAGA ACTACAACAA GAACTATTAG AAAATTTGCA GCAATTTAAT CTTGAATCAC AGGTCTTTTT TTGTGACCAA GATCTAGAAG AATTTTTATT AGAATTACGA GAAATAGAAA CAGAAAATAA TATTGGTGTC TATTTCTATA ATGGCTCTCC TGAATATCGC TCCGTACTGT TAGGATTAAT GTTAATTAAG CCTTTTTTAT CTAAAAAAGC ATTAGTGGTT ATTAATAATT CTAATAAAAG TCTAGTACAG CAAGCTTATT GGGATTTAAT GGCTAGTTTT CCTCAATATA AAATATTATC AGAATTACCA GAATTTGCAC AAATAAATCA CTTATTTGGT AATGGGATTC AACTCCTAAG TTTTGACAGT GAAAGAAATC ATAATTATCC ACCTTCTAGG ATTATTAATA ATCGTCAACA AAATGTCATT ACAGCCATTT CTAATCTACA ACAATGGGAG ACAGAATCAT CCAGGCAGTT GTTCTATGAA GAAGCAATAT ATTTACACCA ACAACAGCAA TGGCAATTAG CAGAAAAAAA ATATCAAGAG TTATTATTAT GGCAACCTAA TAATTCCCTG GTTTGGTTAC AATTAGGAGT GCTTTATTAT CAAATAGAAA ATTATCAACA GTCTATTTCA GCTATATCTA AATCTTTAGA AATAGAACCA TCAAGTTATG GATATTACTA CCAGGGTTTA GGGTTAGAAA AAATCAATCA AATTGAACAA GCGATCGCCT CCCACCAACA AGCAATTAAA CTAGATATTA ATTTCATAGA TCCTTATAAT AATTTAGGCA ATCTCCTCAA GACAAAAAGT GAATTTGAAC AAGCAGAAAC TATTTATCGC CAGAGCATTT CTATTAACCC CAACCATTGG GGAAGTTATA TTAATCTAGG TAATTTAATG CTCGAACAAA ATCGAGTTGA GTTGGCGATC GCTAATTATG AAAAGGCCCT AGAAATCAAT CCCGAAAATT CAGATATAGC TAACAATCTC AACATCGCTC TTCGAGCAAA AAATAACCCT GCCCCAATAT TATTAAATTG GGCACGAAGA CTGCAACAAT TAGGAAAATA TGAAACAGCA ATTTTAGAAT ATCGTAAATA TCTAGAACTT CAACCAACAG ATACTCAAGT CTATTCTTTT CTCTCAGAAT GTTATCAACA ATTAGACCAA AAACAAGCAA CTATAAACAT TCTCCAAGAG GGTACCCAAA CCTGCCCCAA TTCAGGACAG CTCCATTTTA ACTTAATTAT GACCCTATTA CAAAATGGGA GAACTGAACA AGCAATATCT CAAGCAGAAA TAGCTTTACA AAACCTGCCC CAAGATTACA CATTTAAACT CCTAAAACAT TTAATAGTTC CCATTATTTA CCACAGTCCA GAATCAATCA GCTTTTATCG CCAAAGATTT GAAACCGAAA TACAAAACCT CATAAAAACA ACCAATTTAG AAAATACTGA ATCGCGAAAA AACGCTCTTT TAGGTACTAG CAGATTCACT AATTTTTACT TAGCTTATCA AGCTCATAAT GTTCAAAAAT CTCAAATTAT CTATGGTAAT TTTCTCCATA AAATTCTAGG AGCTAATTAT CCAGAATACA TACAACCATT ATCCATTCCA CCCGTAGAAA ATAAAATCCG CATTGGCTAT ATTTCCAACT ATTTACATTC TTATAGTGGC AGCTTGTGGT TAATAGGTTG GTTACGTTAT GCTGACCATA AAAATTTTGA AATTTACTGT TATTATACTG GTAATTCTCC TGACCCAATA ACAGAAAAAT TTCGTCAATA TAGCCATAAA TTTCATCATA TTCCAGGCAA TTTACCCGCA GTTTGCCAAC AAATATTAAA TGATAAACTA CATATATTAG TCTATCCCGA AATAGGTATG GACCCACCCA CAATGCAAAT AGCAGCCTTG CGGTTAGCAC CTATACAATG TACTGCCTGG GGCCATCCAG TAACTACAGG GTTACCAACT ATTGATTATT TTCTATCCAG TGAATTAATG GAGCCAGAAA ATGCTCAAGA ACATTACTCA GAAACTTTAA TTAAACTACC TAATATTGGG GTGGCTTATC CTGAACCTAA AGACATTACT AACCTGACAA AAAAACAGAC TATTCCCCCT TTTCCAAAGA GCGTTAGGGG CGAAAAAAAA TTCCAATTAC CTGAAGATGG AGTTATTTAT TTATGCTGTC AAGCACCTTT TAAATATTTG CCTCAATATG ATTATATTTT CCCAGAAATA GCTTTAGGAG TTCCCCAAGC AAAATTTGTT TTTCTCAGAG GAACTTTATT AAAACCTCGT TTAGAAAAAG CTTTTTATTC TAGAGGGTTA AATAGTGAAG ATTATTGTGT TCATTTAAAG ATTCCAGAAA GGTCAGATTA TTTGATGTTA AATCTACTTT CAGATGTTTT TCTAGATACT TTTACTTGGT CTGGTGGGAA TACTTCTTTA GAGGCGATCG CTTGTAATTT ACCGATAGTA ACTTGTCCTG GAGAATTTAT GCGCAGTCGT CATGCAGATA GTTTCTTAAA AATGTTGGGA GTTATAGATA CTATTGCTAA AGATGAAGCA GAATATATTC ATATTGCTGT TAAATTAGGT TTAGAAACTG CTTGGCGAAA TGACATTTCT CAGAGGATGA GTTTACGCCA TAATTATCTA TTTGATGACC AAACTTGTGT CAATGCCTTA GAAGATTTTT ATCAACAAAT AGTCCGAAAA TATTCCTCGT GGTGA
|
Protein sequence | MDYQKFIQQL PTNYQNWGNN SLQPSDQWPK TLTPKPNVNT LNLMKLLNSA VEHTEPDEIY CEIGTTQGLT LIGALFEHTE KMAYAVNNFS NFDSTGELQQ ELLENLQQFN LESQVFFCDQ DLEEFLLELR EIETENNIGV YFYNGSPEYR SVLLGLMLIK PFLSKKALVV INNSNKSLVQ QAYWDLMASF PQYKILSELP EFAQINHLFG NGIQLLSFDS ERNHNYPPSR IINNRQQNVI TAISNLQQWE TESSRQLFYE EAIYLHQQQQ WQLAEKKYQE LLLWQPNNSL VWLQLGVLYY QIENYQQSIS AISKSLEIEP SSYGYYYQGL GLEKINQIEQ AIASHQQAIK LDINFIDPYN NLGNLLKTKS EFEQAETIYR QSISINPNHW GSYINLGNLM LEQNRVELAI ANYEKALEIN PENSDIANNL NIALRAKNNP APILLNWARR LQQLGKYETA ILEYRKYLEL QPTDTQVYSF LSECYQQLDQ KQATINILQE GTQTCPNSGQ LHFNLIMTLL QNGRTEQAIS QAEIALQNLP QDYTFKLLKH LIVPIIYHSP ESISFYRQRF ETEIQNLIKT TNLENTESRK NALLGTSRFT NFYLAYQAHN VQKSQIIYGN FLHKILGANY PEYIQPLSIP PVENKIRIGY ISNYLHSYSG SLWLIGWLRY ADHKNFEIYC YYTGNSPDPI TEKFRQYSHK FHHIPGNLPA VCQQILNDKL HILVYPEIGM DPPTMQIAAL RLAPIQCTAW GHPVTTGLPT IDYFLSSELM EPENAQEHYS ETLIKLPNIG VAYPEPKDIT NLTKKQTIPP FPKSVRGEKK FQLPEDGVIY LCCQAPFKYL PQYDYIFPEI ALGVPQAKFV FLRGTLLKPR LEKAFYSRGL NSEDYCVHLK IPERSDYLML NLLSDVFLDT FTWSGGNTSL EAIACNLPIV TCPGEFMRSR HADSFLKMLG VIDTIAKDEA EYIHIAVKLG LETAWRNDIS QRMSLRHNYL FDDQTCVNAL EDFYQQIVRK YSSW
|
| |