Gene Tery_4332 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4332 
Symbol 
ID4245984 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6677849 
End bp6678901 
Gene Length1053 bp 
Protein Length350 aa 
Translation table11 
GC content42% 
IMG OID638109219 
Producthypothetical protein 
Protein accessionYP_723797 
Protein GI113477736 
COG category[E] Amino acid transport and metabolism 
COG ID[COG0547] Anthranilate phosphoribosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones11 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGCGATC GCTTTCGAGA ATTACTCAAA ATCATTGGCA GTGGAACCCA CACAGGTAAA 
AATTTAACTC GCCAAGAAGC AGCAGCAGCA ATGCGCATGA TGTTATTAGG AGAAGCAACA
CCTACACAAA TAGGTGCTTT TCTTATTGCT CACCGTATTA AACGCCCCAC TGGTGAGGAG
TTAGCAGGAA TGTTAGATAC CTATAATGAA TTGGGGCCAA AACTTAAAAG TCAACCATCT
ATGGGTACAG TAACCGTCTT AGGTTGTCCT TATGATGGGC GATCGCGGAC TTCACCTGTC
ACCTTACTCA CAGCTTTAAT TTTGGCAACA GCAGGAGTAT TTGTTGTCAT CCACGGTGGA
AGGCGGATGC CGACAAAAGA AGGCATACCT TTTATTGATA TCTGGCAAGG ACTAGGGGTT
GAGTGGGGAA AATTATCGCT GGTGGAGGTT CAACGAGTAT TTGAGGAAAC TGGTCTAGGG
TTCGTCTATT TACCAAGACA TTTTCCTCAA GCAGATGCTT TAGTAAAACA TCGTCGAGAT
ATTGGTAAAC GACCTCCTAT CGCAATAATG GAATTAATTT GGGTACCCTT GGCGGGAGAA
GTTCATTTAG CTGCAGGGTA TGTTCATCCT CCCACAGAAG GTATGTTTCG TGAAGTATTG
GAATTACATG GTTTGAGGAA TTATACAACG GTGAAGGGGT TAGAGGGAAG TTGTGACTTG
CCCCGCGATC GGACAGCTAT TATTGGGGTA TCGTTGTCAT CTGGGAATGA TGCCACATTT
GAACGTCTAT TGTTACATCC GAGTGACTAT AGTTGTGGAG GGAAGGAAGT TGTATTGGGT
TCAACTGCAG AGTTAGTAGA AGAGATACAA AAAATACTAC AGGGTAAAGC CAGTAAGTTA
ATGTCAGCAG TTATTTGGAA TGGCGCTTTT TATTTGTGGC GTTGTGGAAT TTGCTCTGAT
ATTAATGAAG GTTTGTTGAA AGCGGAAAGT TTATTAAATA GTGGTAAAGT TAGGGATAAG
TTGAGAGAAA TTAAAGCAAA AATTGAGATA TAA
 
Protein sequence
MSDRFRELLK IIGSGTHTGK NLTRQEAAAA MRMMLLGEAT PTQIGAFLIA HRIKRPTGEE 
LAGMLDTYNE LGPKLKSQPS MGTVTVLGCP YDGRSRTSPV TLLTALILAT AGVFVVIHGG
RRMPTKEGIP FIDIWQGLGV EWGKLSLVEV QRVFEETGLG FVYLPRHFPQ ADALVKHRRD
IGKRPPIAIM ELIWVPLAGE VHLAAGYVHP PTEGMFREVL ELHGLRNYTT VKGLEGSCDL
PRDRTAIIGV SLSSGNDATF ERLLLHPSDY SCGGKEVVLG STAELVEEIQ KILQGKASKL
MSAVIWNGAF YLWRCGICSD INEGLLKAES LLNSGKVRDK LREIKAKIEI