Gene Information Plasmid Coverage information Fosmid Coverage information Sequence |
Gene Information |
Locus tag | Tery_2708 |
Symbol | |
ID | 4244971 |
Type | CDS |
Is gene spliced | No |
Is pseudo gene | No |
Organism name | Trichodesmium erythraeum IMS101 |
Kingdom | Bacteria |
Replicon accession | NC_008312 |
Strand | - |
Start bp | 4192357 |
End bp | 4195296 |
Gene Length | 2940 bp |
Protein Length | 979 aa |
Translation table | 11 |
GC content | 32% |
IMG OID | 638107771 |
Product | tetratricopeptide TPR_2 |
Protein accession | YP_722370 |
Protein GI | 113476309 |
COG category | [G] Carbohydrate transport and metabolism |
COG ID | [COG4421] Capsular polysaccharide biosynthesis protein |
TIGRFAM ID | |
| 

|
Plasmid Coverage information |
Num covering plasmid clones | 6 |
Plasmid unclonability p-value | 1 |
Plasmid hitchhiking | No |
Plasmid clonability | normal |
| |
Fosmid Coverage information |
Num covering fosmid clones | 6 |
Fosmid unclonability p-value | 0.169437 |
Fosmid Hitchhiker | No |
Fosmid clonability | normal |
| |
Sequence |
Gene sequence | ATGAAAATAG AAAAATATCT CAGATTAGCA AAATCATATT TGACTAAAGG CAATTTAAGT CAGGCAATAG AAATATGTGA ACAAATATTA GAAATACAAC CAAACTCCGC TCATGCTTAC AGAATATTAG GGGAAATTTA CCAAGCAGAA GAGAATTTTG AAAAAGCTAT GTATGCTTAT ACAAAAGCTG TGGAGATTCA ACCAAAATAT GCAGAAGTTC ATGCTTTTCT CGCTTGGTTA TACAGTCAGA AAAAATGGCT GAGTGAAGCA GCTAATCAAT ATCAAAAAGC GATTAATTTA GGACTAAAAT GGCCAGAATT ATATTATAAT TTAGGTAATA TATTCTATCA AGTTCGATAT TTTGAATCAG CAATTCAATG CTATGAAAAT GCAATTGTTA TCAAACCGAA TTACATTAAT GCTTATTTGG GTTTAGGCAT TATTTTTGAA AGGGAAAGAA ATTATCAAGC GGCAGTTGAT ATTTATAGAA AAGTCATAGA ATTAAACCCA GATTTTGTAG AAGGATATAA TAATTTAGGT CGTATTTTAG CTAATTGGAA TCGAAGGTCA GAAGCTATTG AGGTTTATCA ACAAGCTTTA GTTTTAAAGC CTGATTCAGC AAGTTTATAT AATAATCTTG GTTTTACTTT ATTAAATGAA AATCCTATAG AGGCGATCGC TGCTTATTAT CGTGCTATAG AATTAGACCC ATTATTGATC AAAGCCCATT ATAATTTGGG TAAAGCATTA CAAATAATTG GGGAGCATGA ATTGGCAGTA AAATCTTTTC AAGAAGTTAT TAGATTGAAG GGAAAAAAAG ATAATTTATT GGTTTATAGT GATTGTGCTT TTTCTCTAAT GACTATAGGA AAGTTTCAAA CTGCTTTTGT TTACTTAAAA AATGTAATTA CTAATAATAA GTTTGTTGAT GGTTGTTGGC AGGTATTAGA GTCGAAGTTA GGTAATTTGG AGCATATAGA ATATGATAAA ATGTATTTAA CAAAAGTAGC TGTGTTGAAT TTTATTAAAG AGCTAAAAAA GTTAGATATT AATTCAGAAA ATTATCAGCA ATCTTCTCAA AAAATATCTC GGTATTTGCT GAGTTATTAT ATTAATTTTG GCAATGTATT GATGGAAAAT GAGTTTTATA AAAGTGCTGA AAATATTTAT CATAAGGCGT TAAAAATTAA ACCAAATTCG GCAGATATTT ATTGGTTATT AGGGAATAAT TTGGCTAAAC AAAAACGGTT AAATTCAGCA ATTATTAGTT ACCAAATAGC TCTGCAAATT TTACTTAATG AAAATAGAAT TAATCAGACG ACAAAGATTT ATTTTGAGTT GGGTAAGATT ATGGAAAAAC AGCAAAGATG GCAGGCAGCA AGAGATTATT ATAGTAAAGT ATTATCCGGA AAAGTAGAAG ATATATTAAA TAATGATTTT AATAATGATT TTTTGCCTTA TCCAAAGTTA GAGGGATTAT ATTTATCTAC AAAAGATTGG TTGCAGAAAT TAGATGGTAA TAATTATCAG AAACGATATA AAGAAGTAAG ATTAGAAAAA TTAGCAACAG AATATAGAGA AGAATCTATA AAATCTCCTA TTAAAGTAAG TGTTGAAAAT AAGAAGAAAG ATTTAGAATG TCTTGGGTTA AATTGTAGTG CATGTTTAAA GCAAATTTAT CAGTGGTTTG ACCCAAAGTC TCCTGCTCAA GGGATATATA GTCTTAGGGA TCAGGGTGAG TTTAGTCAAG ATATTGATTT GCTGAGTCAG GAAATATCAC TTTTAAACCC CCTTGAAAAA GAGGGGAAAA AATCTCTAAC TCCTGGGAAA AATGAGAGGA TGAGTCCTTC TTCCCTAACA TCTGTAAGTA AGGAGGAAAG TATACTTAGT TTGACAAAAG ATACAAAAAA TGATAAAGAA GGGCTTTCTT ACTTTCAGAA ACTTTCCCGG AGTCCTACCT TCGTGACTAC AGTACCGGAG GGAAGAGCTT GGATAATGCC CAAGAAAAAT TACTGGAGAC TATGTTACGG CATTGCTATT ATGACTCCAG ATAACTATTT ATTGTCAGAC TTGTCCAGAG AATATCCATC TCCATTGCCA GGATGTACAA AGCATGACCC AAGTCAACAT CGGATTTTTG GTTTTGAAGA ACTGCCCCCC CTAGAGAAAA TTAATGGTAA GGTAGCAGTA TTATCGGTGT TATCAGGAAA TGTCTATTTT CATTGGATGG TAGACCTACT GCCAAGAATA GAGATATTGC GTCAGGGTAT TAATTTAGAA GAAATAGATT GGTTTATAGT TAATGACTAT CAACAACCTT TTCAACGAGA GACATTAAAA ACTCTTGGTG TCAAACAAGA GAAAATTTTG GCAAGCGATC GCCATCCTCA TGTTCAAGCA ACAGAATTAG TCGTCCCTTC ATATCCTAGT TATCTGGGAT GGTTACAACC CTGGGGATTA AAATTTTTAA GAGAAGTATT TCTTAGAGGT ATAACTAATA ATAAGTCATA CTTTCCGGAA AAAATATATA TAGGTAGAGG TAATGCTAAG TATCGTCGAG TAATGAATGA AGCAGAAGTA GTAGATATAT TACGCCAGTT TGGTTTTACT TATGTTACTC CAGAGTCAAT ATCCCTAGAA AATCAGATTA GCACTTTTGC CCATGCCAAA ATCATAGTGG CCCCTCATGG AAGTGGTTTA ACAAATATAG TATTCTGTAA CCCAGGGACA AAAGTTATCG AACTTTTCTC ACCCCACTAT CTCAGATACT ATTATTGGCA TATTAGCCAA TTATTAGGAC TTGAGCATTA CTACTTAATT GGTGAAACAT TTAGTTGTTA TCCAATGAGA AATATAATGT ATGAAAGTTC TTTAGTGGAA GATATTTTAG TGAATTTAGA TTCATTAAAT CAGATGCTAA AAGTTGTAGA TATTTTTTAA
|
Protein sequence | MKIEKYLRLA KSYLTKGNLS QAIEICEQIL EIQPNSAHAY RILGEIYQAE ENFEKAMYAY TKAVEIQPKY AEVHAFLAWL YSQKKWLSEA ANQYQKAINL GLKWPELYYN LGNIFYQVRY FESAIQCYEN AIVIKPNYIN AYLGLGIIFE RERNYQAAVD IYRKVIELNP DFVEGYNNLG RILANWNRRS EAIEVYQQAL VLKPDSASLY NNLGFTLLNE NPIEAIAAYY RAIELDPLLI KAHYNLGKAL QIIGEHELAV KSFQEVIRLK GKKDNLLVYS DCAFSLMTIG KFQTAFVYLK NVITNNKFVD GCWQVLESKL GNLEHIEYDK MYLTKVAVLN FIKELKKLDI NSENYQQSSQ KISRYLLSYY INFGNVLMEN EFYKSAENIY HKALKIKPNS ADIYWLLGNN LAKQKRLNSA IISYQIALQI LLNENRINQT TKIYFELGKI MEKQQRWQAA RDYYSKVLSG KVEDILNNDF NNDFLPYPKL EGLYLSTKDW LQKLDGNNYQ KRYKEVRLEK LATEYREESI KSPIKVSVEN KKKDLECLGL NCSACLKQIY QWFDPKSPAQ GIYSLRDQGE FSQDIDLLSQ EISLLNPLEK EGKKSLTPGK NERMSPSSLT SVSKEESILS LTKDTKNDKE GLSYFQKLSR SPTFVTTVPE GRAWIMPKKN YWRLCYGIAI MTPDNYLLSD LSREYPSPLP GCTKHDPSQH RIFGFEELPP LEKINGKVAV LSVLSGNVYF HWMVDLLPRI EILRQGINLE EIDWFIVNDY QQPFQRETLK TLGVKQEKIL ASDRHPHVQA TELVVPSYPS YLGWLQPWGL KFLREVFLRG ITNNKSYFPE KIYIGRGNAK YRRVMNEAEV VDILRQFGFT YVTPESISLE NQISTFAHAK IIVAPHGSGL TNIVFCNPGT KVIELFSPHY LRYYYWHISQ LLGLEHYYLI GETFSCYPMR NIMYESSLVE DILVNLDSLN QMLKVVDIF
|
| |