Gene Tery_2708 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_2708 
Symbol 
ID4244971 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4192357 
End bp4195296 
Gene Length2940 bp 
Protein Length979 aa 
Translation table11 
GC content32% 
IMG OID638107771 
Producttetratricopeptide TPR_2 
Protein accessionYP_722370 
Protein GI113476309 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG4421] Capsular polysaccharide biosynthesis protein 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATAG AAAAATATCT CAGATTAGCA AAATCATATT TGACTAAAGG CAATTTAAGT 
CAGGCAATAG AAATATGTGA ACAAATATTA GAAATACAAC CAAACTCCGC TCATGCTTAC
AGAATATTAG GGGAAATTTA CCAAGCAGAA GAGAATTTTG AAAAAGCTAT GTATGCTTAT
ACAAAAGCTG TGGAGATTCA ACCAAAATAT GCAGAAGTTC ATGCTTTTCT CGCTTGGTTA
TACAGTCAGA AAAAATGGCT GAGTGAAGCA GCTAATCAAT ATCAAAAAGC GATTAATTTA
GGACTAAAAT GGCCAGAATT ATATTATAAT TTAGGTAATA TATTCTATCA AGTTCGATAT
TTTGAATCAG CAATTCAATG CTATGAAAAT GCAATTGTTA TCAAACCGAA TTACATTAAT
GCTTATTTGG GTTTAGGCAT TATTTTTGAA AGGGAAAGAA ATTATCAAGC GGCAGTTGAT
ATTTATAGAA AAGTCATAGA ATTAAACCCA GATTTTGTAG AAGGATATAA TAATTTAGGT
CGTATTTTAG CTAATTGGAA TCGAAGGTCA GAAGCTATTG AGGTTTATCA ACAAGCTTTA
GTTTTAAAGC CTGATTCAGC AAGTTTATAT AATAATCTTG GTTTTACTTT ATTAAATGAA
AATCCTATAG AGGCGATCGC TGCTTATTAT CGTGCTATAG AATTAGACCC ATTATTGATC
AAAGCCCATT ATAATTTGGG TAAAGCATTA CAAATAATTG GGGAGCATGA ATTGGCAGTA
AAATCTTTTC AAGAAGTTAT TAGATTGAAG GGAAAAAAAG ATAATTTATT GGTTTATAGT
GATTGTGCTT TTTCTCTAAT GACTATAGGA AAGTTTCAAA CTGCTTTTGT TTACTTAAAA
AATGTAATTA CTAATAATAA GTTTGTTGAT GGTTGTTGGC AGGTATTAGA GTCGAAGTTA
GGTAATTTGG AGCATATAGA ATATGATAAA ATGTATTTAA CAAAAGTAGC TGTGTTGAAT
TTTATTAAAG AGCTAAAAAA GTTAGATATT AATTCAGAAA ATTATCAGCA ATCTTCTCAA
AAAATATCTC GGTATTTGCT GAGTTATTAT ATTAATTTTG GCAATGTATT GATGGAAAAT
GAGTTTTATA AAAGTGCTGA AAATATTTAT CATAAGGCGT TAAAAATTAA ACCAAATTCG
GCAGATATTT ATTGGTTATT AGGGAATAAT TTGGCTAAAC AAAAACGGTT AAATTCAGCA
ATTATTAGTT ACCAAATAGC TCTGCAAATT TTACTTAATG AAAATAGAAT TAATCAGACG
ACAAAGATTT ATTTTGAGTT GGGTAAGATT ATGGAAAAAC AGCAAAGATG GCAGGCAGCA
AGAGATTATT ATAGTAAAGT ATTATCCGGA AAAGTAGAAG ATATATTAAA TAATGATTTT
AATAATGATT TTTTGCCTTA TCCAAAGTTA GAGGGATTAT ATTTATCTAC AAAAGATTGG
TTGCAGAAAT TAGATGGTAA TAATTATCAG AAACGATATA AAGAAGTAAG ATTAGAAAAA
TTAGCAACAG AATATAGAGA AGAATCTATA AAATCTCCTA TTAAAGTAAG TGTTGAAAAT
AAGAAGAAAG ATTTAGAATG TCTTGGGTTA AATTGTAGTG CATGTTTAAA GCAAATTTAT
CAGTGGTTTG ACCCAAAGTC TCCTGCTCAA GGGATATATA GTCTTAGGGA TCAGGGTGAG
TTTAGTCAAG ATATTGATTT GCTGAGTCAG GAAATATCAC TTTTAAACCC CCTTGAAAAA
GAGGGGAAAA AATCTCTAAC TCCTGGGAAA AATGAGAGGA TGAGTCCTTC TTCCCTAACA
TCTGTAAGTA AGGAGGAAAG TATACTTAGT TTGACAAAAG ATACAAAAAA TGATAAAGAA
GGGCTTTCTT ACTTTCAGAA ACTTTCCCGG AGTCCTACCT TCGTGACTAC AGTACCGGAG
GGAAGAGCTT GGATAATGCC CAAGAAAAAT TACTGGAGAC TATGTTACGG CATTGCTATT
ATGACTCCAG ATAACTATTT ATTGTCAGAC TTGTCCAGAG AATATCCATC TCCATTGCCA
GGATGTACAA AGCATGACCC AAGTCAACAT CGGATTTTTG GTTTTGAAGA ACTGCCCCCC
CTAGAGAAAA TTAATGGTAA GGTAGCAGTA TTATCGGTGT TATCAGGAAA TGTCTATTTT
CATTGGATGG TAGACCTACT GCCAAGAATA GAGATATTGC GTCAGGGTAT TAATTTAGAA
GAAATAGATT GGTTTATAGT TAATGACTAT CAACAACCTT TTCAACGAGA GACATTAAAA
ACTCTTGGTG TCAAACAAGA GAAAATTTTG GCAAGCGATC GCCATCCTCA TGTTCAAGCA
ACAGAATTAG TCGTCCCTTC ATATCCTAGT TATCTGGGAT GGTTACAACC CTGGGGATTA
AAATTTTTAA GAGAAGTATT TCTTAGAGGT ATAACTAATA ATAAGTCATA CTTTCCGGAA
AAAATATATA TAGGTAGAGG TAATGCTAAG TATCGTCGAG TAATGAATGA AGCAGAAGTA
GTAGATATAT TACGCCAGTT TGGTTTTACT TATGTTACTC CAGAGTCAAT ATCCCTAGAA
AATCAGATTA GCACTTTTGC CCATGCCAAA ATCATAGTGG CCCCTCATGG AAGTGGTTTA
ACAAATATAG TATTCTGTAA CCCAGGGACA AAAGTTATCG AACTTTTCTC ACCCCACTAT
CTCAGATACT ATTATTGGCA TATTAGCCAA TTATTAGGAC TTGAGCATTA CTACTTAATT
GGTGAAACAT TTAGTTGTTA TCCAATGAGA AATATAATGT ATGAAAGTTC TTTAGTGGAA
GATATTTTAG TGAATTTAGA TTCATTAAAT CAGATGCTAA AAGTTGTAGA TATTTTTTAA
 
Protein sequence
MKIEKYLRLA KSYLTKGNLS QAIEICEQIL EIQPNSAHAY RILGEIYQAE ENFEKAMYAY 
TKAVEIQPKY AEVHAFLAWL YSQKKWLSEA ANQYQKAINL GLKWPELYYN LGNIFYQVRY
FESAIQCYEN AIVIKPNYIN AYLGLGIIFE RERNYQAAVD IYRKVIELNP DFVEGYNNLG
RILANWNRRS EAIEVYQQAL VLKPDSASLY NNLGFTLLNE NPIEAIAAYY RAIELDPLLI
KAHYNLGKAL QIIGEHELAV KSFQEVIRLK GKKDNLLVYS DCAFSLMTIG KFQTAFVYLK
NVITNNKFVD GCWQVLESKL GNLEHIEYDK MYLTKVAVLN FIKELKKLDI NSENYQQSSQ
KISRYLLSYY INFGNVLMEN EFYKSAENIY HKALKIKPNS ADIYWLLGNN LAKQKRLNSA
IISYQIALQI LLNENRINQT TKIYFELGKI MEKQQRWQAA RDYYSKVLSG KVEDILNNDF
NNDFLPYPKL EGLYLSTKDW LQKLDGNNYQ KRYKEVRLEK LATEYREESI KSPIKVSVEN
KKKDLECLGL NCSACLKQIY QWFDPKSPAQ GIYSLRDQGE FSQDIDLLSQ EISLLNPLEK
EGKKSLTPGK NERMSPSSLT SVSKEESILS LTKDTKNDKE GLSYFQKLSR SPTFVTTVPE
GRAWIMPKKN YWRLCYGIAI MTPDNYLLSD LSREYPSPLP GCTKHDPSQH RIFGFEELPP
LEKINGKVAV LSVLSGNVYF HWMVDLLPRI EILRQGINLE EIDWFIVNDY QQPFQRETLK
TLGVKQEKIL ASDRHPHVQA TELVVPSYPS YLGWLQPWGL KFLREVFLRG ITNNKSYFPE
KIYIGRGNAK YRRVMNEAEV VDILRQFGFT YVTPESISLE NQISTFAHAK IIVAPHGSGL
TNIVFCNPGT KVIELFSPHY LRYYYWHISQ LLGLEHYYLI GETFSCYPMR NIMYESSLVE
DILVNLDSLN QMLKVVDIF