Gene Tery_4420 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4420 
Symbol 
ID4246073 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6811774 
End bp6814512 
Gene Length2739 bp 
Protein Length912 aa 
Translation table11 
GC content43% 
IMG OID638109304 
Producthypothetical protein 
Protein accessionYP_723881 
Protein GI113477820 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.0816667 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGGTGAACG ATGGTAATGT TGATAGTAAC ACAGCCACAA CAACAATAAA TATTACTCCA 
GTCAATGACG CTCCCGTATT AGACTTAGAT GGGAATAATA GTAGTACCGC TACAGGCAGT
GACTACATAA CAACTTTCAC AGAAGGCGGA GGAGCAGTAG CCATAGGAGA CAGCGATGTT
AGTATCACAG ATGCAGATGA TATCAACATA GAGTCAGCCA CAATAACATT AAGCAGTCGA
CCAGACGGAG ATACAGTAGA AAGCTTATTA GTCAACGGTA CACTGCCAAC AGGAATAACA
GCGAGCAGTT ATGACAGTAG TACAGGAGTC ATAACACTGA CAGGTAGCGC TACATTAGCT
GACTATCAGA CAGCCATAGC CCAAATACAA TATAACAACA CCTCGGAAAA CCCAGACACC
AGTGCTCGGA GCGTGACCGT AGTGGTGAAC GATGGTAATT TTGATAGTAA CACAGCCACA
ACAACAATAA ATATTACTCC AGTCAATGAC GCTCCCGTAT TAGACTTAGA TGGGGATAAC
ACAGGGAATG ACTACACAAC AACTTTCACA GAAGGCCTAG GAGCAGTAGC TATAGGAAAC
AACGTTGGTA TAGCAGATGA AGATGATACC AACATAGAGT CAGCCACAAT AACATTAGGC
AGTAGACCAG ATGGAGATAC AGTAGAAAGC TTATTAGTCA ACGGTACACT ACCAACAGGA
ATAACAGCGA GCAGTTATAA CAGTACTACA GGAGTCATAA CACTCACAGG CAGCGCTACA
TTAGCTGAGT ATCAGACAGC CATAGCCCAA ATACAATATA ACAACACCTC GGACAACCCT
AATACCACCG ATAGGACAGT GACCGTAGTG GTGAACGATG GTGATGCTGA TAGTAGTACA
GCCACAACAA CAATAAATAT GACTCCAGTC AATGATGCTC CAGTATTAGA CTTAGATGGG
GATAATAGTA CGACAACAGG CAGTGACTAC ATAACAACTT TCACAGAAGG AACGGCAGTA
AACATAGGAG ACAGCGATGT TAGTATCACA GATGTAGATG ATAGCAACAT ACAGTCAGCC
ACAATAACAT TATCGAACAT ACAAGACGGA GCATCAGAAA GCTTATCCGC CGGTACACTG
CCAACAGGAA TAACAGCGAG CAGTTATGAC AGTAGTACAG GAGTCATAAC ACTGACAGGT
AGCGCTACAT TAACTGACTA TCAGACAGCC ATAGCCCAAA TACAATATAA CAACACCTCG
GAAAACCCAG ACACCAGTGC TCGGAGCGTG ACCGTAGTGG TGAACGATGG TAATGTTGAT
AGTAACACAG CCACAACAAC AATAAATATT ACTCCAGTCA ATGACGCTCC CGTATTAGAC
TTAGATGGGG ATAACACAGG GAATGACTAC ACAACAACTT TCACAGAAGG CCTAGGAGCA
GTAGCTATAG GAAACAACGT TGGTATAGCA GATGAAGATG ATACCAACAT AGAGTCAGCC
ACAATAACAT TAGGCAGTAG ACCAGATGGA GATACAGTAG AAAGCTTATT AGTCAACGGT
ACACTACCAA CAGGAATAAC AGCGAGCAGT TATAACAGTA CTACAGGAGT CATAACACTC
ACAGGCAGCG CTACATTAGC TGAGTATCAG ACAGCCATAG CCCAAATACA ATATAACAAC
ACCTCGCAAA ACCCAGACCC CACGGATCGG ACCGTGACCG TAGTGGTGAA CGATGGTGAT
GCTAATAGTA ACACAGCCAC AACAACAATA AGTCTTGTTC CAGTCAATGA CCCAGTCCAC
TTTGATTTTA ATGCTGATGG AGTGGCAGAC ATTCTCTGGC GTCATAAAAG TCTCCAAAAT
GGACCTAACA GGATCTGGTT GATGAAGAAT GACGGCACAC GGGATAGTAT CGTTAACCCT
GGATCTTTTG GTTCAAATTG GAATGTAGAA AGAGTGGGAG ATTTCAATGC AGATGGAGTG
GCAGACATTC TCTGGCGTCA TCAAAGTCTC TCATCTGGAC CTAACAGGAT CTGGTTGATG
AAGAATGACG GCACACGGGA TAGTATCGTT AACCCTGGAT CTTTTAATTC AAATTGGAAT
GTAGAAGAAG TGGGAGATTT CAATGCAGAC GGAGTGGATG ACATTCTCTG GCGTCATAAA
AGTCTCCAAA ATGGACCTAA CAGGATCTGG TTGATGAAGA ATGACGGCAC ACCCGATAGT
ATCGTTAACC CTGGATTTTT TGGTTCAAGT TGGAATGTAG AAGAAGTGGG AGATTTCAAT
GCAGATGGAG TGGCAGACAT TCTCTGGCGT CATAAAAGTC TCCCACATGG ACCTAACAGG
ATCTGGTTGA TGAAGAATGA CGGCACACCC GATAGTATCG TTAACCCTGG ATTTTTTAAT
TCAAATTGGA ATGTAGAAGA ATTGGGAGAT TTCAATGCAG ATGGAGTGGA TGACATTCTC
TGGCGTCATA AAAGTCTCTC ACATGGACCT AACAGGATCT GGTTGATGAA GAATGACGGC
ACACCCGATA GTATCGTTAA CCCTGGATTT TTTAATTCAA ATTGGAATGT AGAAGGAGTG
AGAGATTTCA ATGCAGATGG AGTGGATGAC ATTCTCTGGC GTCATCAAAG TCTCCCAAAT
GGACCTAACA AGATCTGGTT GATGGAGAAT GACGGCACAC GGGATAGTAT CGTTAACCCT
GGATCTTTTA ATTCAAATTG GGATATAGCT GGAATGTAA
 
Protein sequence
MVNDGNVDSN TATTTINITP VNDAPVLDLD GNNSSTATGS DYITTFTEGG GAVAIGDSDV 
SITDADDINI ESATITLSSR PDGDTVESLL VNGTLPTGIT ASSYDSSTGV ITLTGSATLA
DYQTAIAQIQ YNNTSENPDT SARSVTVVVN DGNFDSNTAT TTINITPVND APVLDLDGDN
TGNDYTTTFT EGLGAVAIGN NVGIADEDDT NIESATITLG SRPDGDTVES LLVNGTLPTG
ITASSYNSTT GVITLTGSAT LAEYQTAIAQ IQYNNTSDNP NTTDRTVTVV VNDGDADSST
ATTTINMTPV NDAPVLDLDG DNSTTTGSDY ITTFTEGTAV NIGDSDVSIT DVDDSNIQSA
TITLSNIQDG ASESLSAGTL PTGITASSYD SSTGVITLTG SATLTDYQTA IAQIQYNNTS
ENPDTSARSV TVVVNDGNVD SNTATTTINI TPVNDAPVLD LDGDNTGNDY TTTFTEGLGA
VAIGNNVGIA DEDDTNIESA TITLGSRPDG DTVESLLVNG TLPTGITASS YNSTTGVITL
TGSATLAEYQ TAIAQIQYNN TSQNPDPTDR TVTVVVNDGD ANSNTATTTI SLVPVNDPVH
FDFNADGVAD ILWRHKSLQN GPNRIWLMKN DGTRDSIVNP GSFGSNWNVE RVGDFNADGV
ADILWRHQSL SSGPNRIWLM KNDGTRDSIV NPGSFNSNWN VEEVGDFNAD GVDDILWRHK
SLQNGPNRIW LMKNDGTPDS IVNPGFFGSS WNVEEVGDFN ADGVADILWR HKSLPHGPNR
IWLMKNDGTP DSIVNPGFFN SNWNVEELGD FNADGVDDIL WRHKSLSHGP NRIWLMKNDG
TPDSIVNPGF FNSNWNVEGV RDFNADGVDD ILWRHQSLPN GPNKIWLMEN DGTRDSIVNP
GSFNSNWDIA GM