Gene Tery_1874 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_1874 
Symbol 
ID4245495 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp2867932 
End bp2869743 
Gene Length1812 bp 
Protein Length603 aa 
Translation table11 
GC content35% 
IMG OID638106995 
Productcell surface protein 
Protein accessionYP_721603 
Protein GI113475542 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.665345 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones14 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGATGAAC AATTCTTAGG AGATAACTCC TCAATAATGT CTAACTTTTC AGAGCCTTCG 
GTTGAAATTT TTGACTTGTT AGGAGGTAGT AACTTTGATG AAGGAATAGG AATAACTACT
GACAATGAAG GCAATATTTA TATAACTGGT AGTACCACAT CTACAGATTT TGAAGTGACT
CCAAATGCGG TACAAAATAG CTTTGGTGGA GGTGATGAAT TTCGGGATGG TGATGCTTTT
GTTGCTAAAT ATTCCCCTGA TGGAACTCAA GTTTATGCTA CATATATAGG TGGAAGTGGT
AGAGATTTTG GTACGGATAT TGCTTTAGAT ACTATAACCG GAGATATTTA CATTACGGGT
AGTACTAATT CTGTTGACTT TCCGACAGTA AATGCTTTGC AAAATACTTA TGGGGGTGGT
GATTTTTCTG GGGATGCTTT TGTGGTCAAA CTATCTAATG ATGGAAATAA TATTCTTTAT
TCTACTTATC TAGGTGGGCA AGATAACGAT TATAGTAGTG CTATTGCTGT AGATAATAAT
GGGGATGCTT ATATAACTGG AGAAACTGGT GCCCAACTCC GTTTTCCAAT TCAACCTATT
CCTGGAGTTG GTGATTTTCC AACTACAGAA AATGCTCTAC AAAAGACTCT TGTTAATGAA
CTTAACCGAG ATAGTTTTGT GAGTAAGATA TCTACTGATG GAAATCAATT AATTTACTCA
ACTTTATTAG GTGGAAATGA CACAGAAATA AGTCAGGATA TTACTGTTGA CAATAATGGT
AATGCTTATA TTACTGGTCA AACTCGTTCC TTTGATTTTC CAACTGCAAA TGCAGTACAA
AATACGATAG GAGGAGATGG AGATGTATTT ATTACTCAGC TTAATAGTGA TGGTAGTGAC
CTGATTTTTT CTTCTTACTA TGGTGCTGTA GATGGAGATA TTGGTAATGG TATTGCTGTA
GATGATATGG GTAGTATCTA TATAGCTGGA AGTTCGGGTA GTCAAATTAT GGGCGGTGAT
GCGGTAGTAC CTGCAGTTGG TGAGTTTCCT ATTGTTAATG CTTTGTACAA TACTTTTGGT
GGGGGTGAAA GTGATGGAAT ATTGATTAAG ATCAATACTG AGCGCTTAGT TACTTTTGGT
GATAGTGAAA GTGATGAAAT ATTACTTAGG GAAATTGATT ATCGTTCTGT TGAATATGCT
ACTTACTTAG GTACGGAAAA TTTAGATTTT ATTGAGAGTA TTGATCTTGA TGCTGCAGGA
AATATTTATA TTGTTAACAA CAATAATCTT TTTAATACTG TTGTGAGTAA AATATCTAAT
GATGGTCAAG TTATAGAATA TTCTATTCCC TTTCGGATAA ATGACAATAT TGGGTTATTT
GGTAATGATA TTAAAGTAGA TGAAGTTGGT AATGCTTATT TTGTCGGCTT GACTATTCCT
AATACTGATT TAGATATAGA TAATGAAGAT GCCTTTATTG GTAAAGTTAC TGCCAGTACT
TCCTCTCCTG ATCTTCCGAT CTTTGTTACT CCTCCTAATA TTGGTGAGAG TTTCGACGAA
AATCTTTATT TAATAGAAAA TCCTGGTGTG GCAAATGCTG TGGCAAACGG TTTTTTTGAT
AGTGGGTTTG AGCATTGGTT GGAGTTTGGT TTTTTGGAAG GGCGATCGCC TCAGTTGGCT
TTTGATGAGC AATTTTATTT GGCTACTTAC CCAGGAGTTG CTAGTGCTGT AGCAAATGGT
GTTTTTATTA ATGGTTTAGA ACATTACGTT AAGTTTGGTG AAGCAGAAGG GCGTTTACCT
ATGAGGAGCT AA
 
Protein sequence
MDEQFLGDNS SIMSNFSEPS VEIFDLLGGS NFDEGIGITT DNEGNIYITG STTSTDFEVT 
PNAVQNSFGG GDEFRDGDAF VAKYSPDGTQ VYATYIGGSG RDFGTDIALD TITGDIYITG
STNSVDFPTV NALQNTYGGG DFSGDAFVVK LSNDGNNILY STYLGGQDND YSSAIAVDNN
GDAYITGETG AQLRFPIQPI PGVGDFPTTE NALQKTLVNE LNRDSFVSKI STDGNQLIYS
TLLGGNDTEI SQDITVDNNG NAYITGQTRS FDFPTANAVQ NTIGGDGDVF ITQLNSDGSD
LIFSSYYGAV DGDIGNGIAV DDMGSIYIAG SSGSQIMGGD AVVPAVGEFP IVNALYNTFG
GGESDGILIK INTERLVTFG DSESDEILLR EIDYRSVEYA TYLGTENLDF IESIDLDAAG
NIYIVNNNNL FNTVVSKISN DGQVIEYSIP FRINDNIGLF GNDIKVDEVG NAYFVGLTIP
NTDLDIDNED AFIGKVTAST SSPDLPIFVT PPNIGESFDE NLYLIENPGV ANAVANGFFD
SGFEHWLEFG FLEGRSPQLA FDEQFYLATY PGVASAVANG VFINGLEHYV KFGEAEGRLP
MRS