Gene Tery_0271 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0271 
Symbol 
ID4241638 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp417849 
End bp420950 
Gene Length3102 bp 
Protein Length1033 aa 
Translation table11 
GC content44% 
IMG OID638105612 
Productpentapeptide repeat-containing protein 
Protein accessionYP_720227 
Protein GI113474166 
COG category[S] Function unknown 
COG ID[COG1357] Uncharacterized low-complexity proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.977531 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0705059 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGCTGTTA AACTGTTGGA CGATTTTTCG AAATCTCTCG GGTTGGAAAC TGTTAACGGT 
GGGGCTAACC TTGTTAAGGC GCTTTTTAAA TTAGCTGAGA CACTAGAAAA ATATAAGGAT
ACTCAGGAAT TAAAGTCTGC TATTGAAAAG ATTAATATTG AATCTCTGTT GGATGCACTA
AATTCTCCAT TGGGAAGTGT TGTTAGAGAT AGTGTACCTT TCTTGCCTAT TGCAACTGGA
ATTATTTCTT ATATTGTTAA ACAAAGTCGT CAGGAGCCTA CTTTGGAAGA TGAGGTGCAG
TTGTTGGTTC AGGTGGCTTT TTTAGAAAGT TTTCGACAAT TTTTTGTAGA GCATCCAGAA
ATTAAGGAGC AGTTCACAGA CACTGAGGCT TCGGAGGATG TAAAAAAACA GATTAAGAAG
TTGGGTAAAG ATGGTTTTAA TCGCCAAGAT GCTAAGGATA CTTTGATTTG TTTTTATGAT
TCACCGTTGA GAGAAAAGTT TGATAATGTT GTATTGGCAC GCTTAAAAGA GTCTGGTTTG
GATGAAAAAA CGGCGAAAAT AGTGACTGAA AGAATTTCTC GCAGTACCCA TCGTTATATG
AAAGAAGCGG TATCAGAGGT GAAGGATGAT GCTAAAAAAT TAGCGGGAAT TTATGGGTAT
GGCTGGCAGC AAGATTTGGA GGTTTATGGG AGTGTTGATG GGTATTTGAA AAATAAAGTT
GCTCGTCTGC CAGAGGAAAA AGTATTTGAT GAAAGTTTCT CTTTTCAGGA TATTTATGTA
CCACTTGAGG TCGAGTCTGT GTCCGATGGA GAGGTTGATA AAAATGCTGA ATCTCAGAAT
ATTGAGAAGT GGGCAAAAAC AATTTTGTTA CCTCAACACT TTGATGAAAA TTTTGATGAA
AATTCGCGGA AAAATAATGA GGAAAAAGAT AAAAAAGATA AACAGGTTTT ATTTATTCAA
GGGGGGCCTG GAAGAGGAAA AAGTGTATTT TGCCGGATGT TTGCTGACTG GGTGCGGCGG
GAACTACATC CTATCTATAC CCCTATTTTG ATCCGGTTGC GAGATGTGAA AAATTTTGCA
GCAAATATTG ATCAGACTCT GGCTGATGCT GTGAATTACG ATTTTGTGAA AATTGATGGG
GGCTGGTTGA CAGACCGGAA TACTCGGTTT CTGTTTTTGC TGGATGGGTT TGATGAGTTG
TTGTTGGAGC GGGGGGCAAG CAGTGAGTTG AAGGTGTTTT TAGATCAGGT AGCACAGTTT
CAGAAACAGG CAGCGGAAAA TAAGGAGCGC GGTCATCGGG TTTTGATCAC GGGTAGACCT
TTGGCGCTTT ATGGTATTGA AAGGTTGATG CCTCCTAATT TGGAGCGGGT GAGTATTTTG
CCGATGGATG ATGATATCCA GGGACAGTGG TTTGAGAAGT GGCGGACGGT GGTGGGGGAT
GAGGAAACAG AAAAGTTTGG GGAGTTTTTA CGGAGTGAAC AATGTCCGGA GCAGGTTAAC
GAGTTGGCGC GGGAACCTCT TTTGTTGTAT TTGCTGGCTT CAATGCACAG GGGTGGGAAG
TTAAAGGTAG AGATGTTTGC TAATACTGAT GTTGGTGAGA CAAAGGTTTT AATTTATGAG
CAGGCGTTGG AGTGGGTACT GGAAAAACAG CGGGTAGAAG AGGGTAAAAA TATTAATCTT
GAAATTACGA AGTTGGAGCC AGAAGATCTG GAGGTTTTAT TGACAGAGGC GGGGTTGTGT
GTGGTGCAGT CTGGTGGGGA ATATGCTGCT ATAGAAATGA TCGAGGAGCG GCTGGTGAAA
AAGGGAGATA AGGAGTTAAA GCGGTTAATT GAAAATGCTA GGGAAGATAA GCGGGAGGAT
GGGTTGAAAA ATGCTTTGGC TGCTTTTTAT TTGAAGTCGG CAACGGGGGC AGAAAATTCG
GTGGAGTTTT TTCATAAGAG TTTTGGTGAG TTTCTCTGTG CAAAACGGAT GGCAGAAAGT
CTGGAATATT TTACGCAGAA AACTAAGAAA GGACGCAAGT TTAGTTACTC TGTTTCTGAT
GAGGAGTTGG TATGGCAGGT TTACGATCTG TTTGGTTATG GGGGTTTAAC CGTGGAAGTT
GTGGAGTATT TGATGGCTTT GTTGGTGAAA AGTGAAGCAA AGCTGGTGGT TTTGTTTGAG
CGCTTACATG AGTTTTATCT AGACTGGTGT GATGGGAAGT TTATTGAGGC GACGGAGGAA
ACTTTACCTC AGAAAAAGGC GAGGGAGTTG CAGCAGTGGC GTATTAAGTC TGGGCAACGC
CAAGTGGATA TTTATACTGG GTTGAATGTG ATTATTTTGT TGTTTGAGCT GGATCGTTAT
GGGCAGTCTC AGGAGGAGTT GCGGGAGCAG TTGCGTTTTT ATCCTTGCGG TCAACATGGT
AGTGAAAATT TTGAGGCTCT AAAACTGTTA ACAATTATTG GCTATAGTCA ATGTTTAGGA
GTTGGTGCGT TTTGGGAAAC AGTAGGTCAG TTTCTAAGTG GTGCCGACCT CAGATATGCC
GACCTCAGTG GTGCCTACCT CATAGTTGCC AACCTCAGAT ATGCCGACCT CAGTGGTGCC
TACCTCATAA GTGCCGACCT CAGTGGTGCC TACCTCATAG GTGCCAACCT CATAGGTGCC
GACCTCAGTC GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCTAACCT CAGTGATGCC
AAACTCAGTG GTGCTAACCT CAGTGATGCC AAACTCAGTG GTGCCGGCCT CAGTGGTGCC
GACCTCAGAT ATGCCGACCT CAGTGGTGCT GACCTCAGTC GTGCCAAACT CAGTGATGCC
GGCCTCAGTG GTGCCAACCT CAGTGTTGCC GGCCTCAGTG GTGCCGACCT CAGATATGCC
GACCTCAGTG GTGCCGACCT CAGATATGCC GACCTCAGTG GTGCTGACCT CAGTGATGCC
AACCTTAGTA ATGTCAGATG GAATAGTCAA ACCAAGTGGT CAAATACTAT TGGTTTACAT
GAAGCAATAG AAGTTCCAGA AGACTTACAA CAAGATCCAG AATTTGCCGC AGCAGTTGCT
GAGTCGGAAG CAGCAAGTCA AGAACAACAG TACATTTTTT GA
 
Protein sequence
MAVKLLDDFS KSLGLETVNG GANLVKALFK LAETLEKYKD TQELKSAIEK INIESLLDAL 
NSPLGSVVRD SVPFLPIATG IISYIVKQSR QEPTLEDEVQ LLVQVAFLES FRQFFVEHPE
IKEQFTDTEA SEDVKKQIKK LGKDGFNRQD AKDTLICFYD SPLREKFDNV VLARLKESGL
DEKTAKIVTE RISRSTHRYM KEAVSEVKDD AKKLAGIYGY GWQQDLEVYG SVDGYLKNKV
ARLPEEKVFD ESFSFQDIYV PLEVESVSDG EVDKNAESQN IEKWAKTILL PQHFDENFDE
NSRKNNEEKD KKDKQVLFIQ GGPGRGKSVF CRMFADWVRR ELHPIYTPIL IRLRDVKNFA
ANIDQTLADA VNYDFVKIDG GWLTDRNTRF LFLLDGFDEL LLERGASSEL KVFLDQVAQF
QKQAAENKER GHRVLITGRP LALYGIERLM PPNLERVSIL PMDDDIQGQW FEKWRTVVGD
EETEKFGEFL RSEQCPEQVN ELAREPLLLY LLASMHRGGK LKVEMFANTD VGETKVLIYE
QALEWVLEKQ RVEEGKNINL EITKLEPEDL EVLLTEAGLC VVQSGGEYAA IEMIEERLVK
KGDKELKRLI ENAREDKRED GLKNALAAFY LKSATGAENS VEFFHKSFGE FLCAKRMAES
LEYFTQKTKK GRKFSYSVSD EELVWQVYDL FGYGGLTVEV VEYLMALLVK SEAKLVVLFE
RLHEFYLDWC DGKFIEATEE TLPQKKAREL QQWRIKSGQR QVDIYTGLNV IILLFELDRY
GQSQEELREQ LRFYPCGQHG SENFEALKLL TIIGYSQCLG VGAFWETVGQ FLSGADLRYA
DLSGAYLIVA NLRYADLSGA YLISADLSGA YLIGANLIGA DLSRADLRYA DLSGANLSDA
KLSGANLSDA KLSGAGLSGA DLRYADLSGA DLSRAKLSDA GLSGANLSVA GLSGADLRYA
DLSGADLRYA DLSGADLSDA NLSNVRWNSQ TKWSNTIGLH EAIEVPEDLQ QDPEFAAAVA
ESEAASQEQQ YIF