Gene Tery_0190 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_0190 
Symbol 
ID4242196 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp285862 
End bp287871 
Gene Length2010 bp 
Protein Length669 aa 
Translation table11 
GC content40% 
IMG OID638105537 
Producthypothetical protein 
Protein accessionYP_720154 
Protein GI113474093 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.474919 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.481243 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCCAATAA AAACCAACCC AATAGCTAGC CTAATTCTCA GCTTATTTTC CCTGGCTCCA 
GCTATTACAC CAACTTTAGC CGCGCTACCC TCTACCCCAG ATCAAGAAGA AACCTGCGAA
ATTTTAGTTA TCGGTGGTGG GTTAGCAGGA GCTGCAACCG CCTATGAAGC ATTATTAGCA
GGGCGCACCG TTTGTATGAC TGAAATCACT GACTGGGTAG GCGGTCAAAT TTCCTCTCAA
GGCACCTCTG CTCTTGACGA ACGCGGAACT CAACGCAGTC TTCTATTTTA TCCCCGTGGT
TATCTAGAGT TACGCGAACG CATTAAAGAA AAATATGGTA GATTAAACCC TGGCAACTGT
TGGGTAAGTG TTTCCTGTTT TATGCCTTAC GATGGTCACG CTATCCTATT CCAGATGTTA
GAAAATGCTG CCAAAAAAGG TGGCGGTACA TTAAAATGGT TTCCCAATAC TGTTATCAAA
CAATTAGACA TTTCTCAGAA CCAAATTAAT AAGGCGATCG CTATCCAACA TCAACCAGCA
GATGATACCC CACCCATAAA CATTGAACCT CTATCTCAAA TTATCGAAGA TGCCTACCTT
TACGAAAATT CAGCAAGATT TGATAAAACT ATAATTCAGT TTGTACCTCA ACCATCTTCA
GAAAAAACAG GTGGTGCAGA CTGGTACGTT GTTGAAGCAA CTGAAACCGG AGAAGTTATT
GCTCTGGCAG ATGTTCCCTA CCGTCTGGGT ATCGATCCCC AAACCTACGT TAACCCCTCA
TCTCCTATTG ATACCACAGA CCCCTACTGC ACTCAAGGTT TCACTTACAC TTTTGCGATG
GAGGCGACTA AAACACCTCA AGCCCATGTC AAACCGCCAT TTTATTCTCA GTATTCCCCC
TACTATAGTT ATGAATTATC TAGATTAGCT AATTTCAACC TAGTTTTCAG CTATAGACAA
ATTTGGTCGA CGAAACCAGA CGCACCTCAA CCAGGTAACT ACAGAAAACG AACTATTTAT
CCCGGTGATA TTTCCATGCA AAATTGGACT TGGGGTAATG ACTACCGCCC TGGAAACAAG
GATGATAATT TAATTTACAG TCGAGGGCAA CTACAAGCTA CAGGTCAACT ACAACCGGGA
GGCTGGATGG GCGGCTTGCG TACTGAAGCC CTCCGTCGAG GTGAGGAAAA TGCTCTTGGT
TATTTTTATT GGTTAGTTGA AGGTACTACT GACTCCCAGT TAGGAGAAGG AGTGAAGCAA
AAATATCCAA ATTATCGTTT ATTGAGTGGG TTTGATACAC CGATGGGAAC AGCCCATGGG
TTATCAAAAT ATCCTTATAT TCGAGAAGGA CGACGCATTA TTGGTCGTGT TGGAAAAACT
CATCCAAGAG GTTTTACAGT GGTGGAGGTT GATATTGCTA AAAAGAATTT TCGGGATGAT
TTTTATCAAA AAAATTTAAG GTCAGATGAG TTTAATTATC TTTGGGGAAT AGTTGGTGGT
TTTGTACCTA AAAATTCAAA AATTAATTCA GTAGAAAAAA TCCCTCAACG AGCAAGGGCA
ACTGTTTTTC CTGATAGTGT AGGAATTGGT CATTATGCCA TAGATTTTCA TCCTTGCATG
ACAAAAAGCC CGCCAGAAGT ACCAAATAAT TCTGAACGAA AAGACATTAG GAAAGGTCAG
GGTGCTACCT ATCCTTTTCA AATACCTTTA CGAGCAATGA TCCCGCAAAG AATTAATAAT
TTATTAGTAG CTGGAAAGAG TATTGCAACA AGTTATATTG CTGCTGCTGC CTATCGAGTT
CATTCTTTTG AATGGTCTGT TGGAGCTGCT GCTGGTACAA CAATTGATTT TGCTTTAGAA
AGAGGTATTT TTCCTTATGA ATTAATTGAT GATATGCCTT CTAAAGAATG GGAATTAGAA
ATATTACAAG GTGATTTAAA TAAAAATGGG AATCACACAG CTTTTCCTGA AACTTCTATT
TTTAATAATT CTTGGAATGA ATGGAAATAG
 
Protein sequence
MPIKTNPIAS LILSLFSLAP AITPTLAALP STPDQEETCE ILVIGGGLAG AATAYEALLA 
GRTVCMTEIT DWVGGQISSQ GTSALDERGT QRSLLFYPRG YLELRERIKE KYGRLNPGNC
WVSVSCFMPY DGHAILFQML ENAAKKGGGT LKWFPNTVIK QLDISQNQIN KAIAIQHQPA
DDTPPINIEP LSQIIEDAYL YENSARFDKT IIQFVPQPSS EKTGGADWYV VEATETGEVI
ALADVPYRLG IDPQTYVNPS SPIDTTDPYC TQGFTYTFAM EATKTPQAHV KPPFYSQYSP
YYSYELSRLA NFNLVFSYRQ IWSTKPDAPQ PGNYRKRTIY PGDISMQNWT WGNDYRPGNK
DDNLIYSRGQ LQATGQLQPG GWMGGLRTEA LRRGEENALG YFYWLVEGTT DSQLGEGVKQ
KYPNYRLLSG FDTPMGTAHG LSKYPYIREG RRIIGRVGKT HPRGFTVVEV DIAKKNFRDD
FYQKNLRSDE FNYLWGIVGG FVPKNSKINS VEKIPQRARA TVFPDSVGIG HYAIDFHPCM
TKSPPEVPNN SERKDIRKGQ GATYPFQIPL RAMIPQRINN LLVAGKSIAT SYIAAAAYRV
HSFEWSVGAA AGTTIDFALE RGIFPYELID DMPSKEWELE ILQGDLNKNG NHTAFPETSI
FNNSWNEWK