Gene Tery_1787 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1787 
Symbol 
ID4243769 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2728673 
End bp2730601 
Gene Length1929 bp 
Protein Length642 aa 
Translation table11 
GC content33% 
IMG OID638106911 
Producthypothetical protein 
Protein accessionYP_721519 
Protein GI113475458 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.799835 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCAAACTA ACAATCATTT ATCCATAAAT CCTTATGTTT TTGGCAAACC AATTTATGAA 
TACAACAACT TATTTGGTAG AAAAAATGAT GTTGATAAAA TTAAAGATCA CATCATTAAC
AAAGATATAA AAATAACTTT ATTGCATGTC CAAAGACGTA TTGGTAAAAC TTCATTGATA
ACTTGTTTGC CTCAGTCTTT CACTGAGGAG CAGAATGGTG TTAAGTTTGT TACTTTTTCA
TTTCAAGGTT ATAAAGATAA GCCAATCCCT GAAATACTAA ATTATCTTGC TGATGAGATC
GCTGGTACTA TTCAACTTCC TCAAAAAGTA AGAGATCAGG CTGATACTAC GCACAACTTT
TTTGAACTTT TTTTGCCGAA AGTTATCGAT CAATATTTGT CAGGTCAAAA GTTGGTTCTT
CTCCTTGATG AATTTGATGT TTTAGAAGAA AAAGATAAGA AAGGAAAAGT ATTATTTGAT
TACTTAAAAA AAGCTGTTAA GGAACAAAAA AAACTATTTG CTATTCTGGT TTTTGGTAGA
CCCTTAAAGG ATATGAAGTA TCTAGAAACA TTTTTACAAG AAGAAGGTCA AGAGACTATA
GAAGTTGGTT TACTGGATTA TGAAGGTACA CAAGATTTGA TTGTTGAGCC ACTTAAGAGA
ATACAGAGCG TATTTAGATA TGAAAAAAGT GCAATAGATA GAATTTGGGA ACTATCTGCT
GGCCATCCTT CTTTGACACA ACTGCTGTGC TCAAATGTTT TTGTTCATTG TAGGAATAAG
CAACAGAGTG TAGTTCGGAA AGACGATGTA GACTCAATTT TAAGTCAAGC AATGGAAGAA
GGCCAGGCAA TATTACAAGG CTTTCTAGAG CCTTTAAGCG ATATTGAAAA GTTATTTTTT
TTTGCAGTAG CAGAAGCTCA AGAACAAGGC ACAGACCCAT TAAAGATTCT GAAAATAATA
CAAAAAACTA CAATAACACC GGCAGATTTT AGAAGAGCAC GAGAGCGTTT AATAGAGTTA
GGCTTTGTAG AAAAAAATGG CAAAGGTCTT AAAATAAAAG TTGAATTAGT CCGGCTCTGG
CTAATAGAAA AAAACCCCTT ACCCAACAAT AAACAGAGGA AACCGAAGGA AGGAAGAAAA
AAGATCAAAC GTCATATAAC CTCAAGTCAA CCCAACAGAC CTAACCCAGT TGCACAATTC
ATTGCTTTTA TTGCGTTGAT AAGCGTTATA GTTTTTATTG GACAGAAGTT ACTCTCTAGG
ATTGATAGCT CCAAAAACTA TGAACGCTTT CAGTCTGACT GTTACAGACT ATCAGAGGAA
ATAAGCAACG CTTTAGAAGA GAAAAAAGAT ACAACGCAGT TGCAAGTCAT CAAAAAAGTT
AGAACTGAAT GGTCGAGAGA AAAAAAAGGC TTATTAGACA AACAATGCCC ATATTCTTAT
GAACTAGATG CAAAATATAA TGCATTACTA CAGTACTATG GACAAAGTAA AGTAGATACT
GGAAACTTTG ATGAAGGTAT AGAAGCATTT TGTGAGATTA CCAGTGAATA CAAGAATTTT
TCTGACATTA AAAAAATCTT TGAAAGATGG GTACTAATAG ACAAAAGATT ATCTAATGAA
AGTACAAAAA GGGTGCTGAA GCAAATAATT AAGCAAAATC AATCAGGAAA TGATTGCCTT
GTTTATTCAT TTAAAGACGA TAGAAATAAA AATGATCTGT ATGACCTGAA AGCTCAAGTT
CATGCTGATG ATTATGAGTA TGGCGAAGCG GTCGAGTCAT ATTGCAAAAT TACAGAAAAC
TATTATAAGT TTGAGACTGT TGTTAAACAG CTAAAAAAAT TGAAACGAGA AAATGTAGAG
AAAGTAGAGG AAAAACTCAA AGAATTAAAC GATCCGTGTC CAGCATTTCC TCCCTCACCA
GACAATTAA
 
Protein sequence
MQTNNHLSIN PYVFGKPIYE YNNLFGRKND VDKIKDHIIN KDIKITLLHV QRRIGKTSLI 
TCLPQSFTEE QNGVKFVTFS FQGYKDKPIP EILNYLADEI AGTIQLPQKV RDQADTTHNF
FELFLPKVID QYLSGQKLVL LLDEFDVLEE KDKKGKVLFD YLKKAVKEQK KLFAILVFGR
PLKDMKYLET FLQEEGQETI EVGLLDYEGT QDLIVEPLKR IQSVFRYEKS AIDRIWELSA
GHPSLTQLLC SNVFVHCRNK QQSVVRKDDV DSILSQAMEE GQAILQGFLE PLSDIEKLFF
FAVAEAQEQG TDPLKILKII QKTTITPADF RRARERLIEL GFVEKNGKGL KIKVELVRLW
LIEKNPLPNN KQRKPKEGRK KIKRHITSSQ PNRPNPVAQF IAFIALISVI VFIGQKLLSR
IDSSKNYERF QSDCYRLSEE ISNALEEKKD TTQLQVIKKV RTEWSREKKG LLDKQCPYSY
ELDAKYNALL QYYGQSKVDT GNFDEGIEAF CEITSEYKNF SDIKKIFERW VLIDKRLSNE
STKRVLKQII KQNQSGNDCL VYSFKDDRNK NDLYDLKAQV HADDYEYGEA VESYCKITEN
YYKFETVVKQ LKKLKRENVE KVEEKLKELN DPCPAFPPSP DN