Gene Tery_4376 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4376 
Symbol 
ID4246029 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6743830 
End bp6744918 
Gene Length1089 bp 
Protein Length362 aa 
Translation table11 
GC content37% 
IMG OID638109263 
Productaminodeoxychorismate lyase 
Protein accessionYP_723840 
Protein GI113477779 
COG category[R] General function prediction only 
COG ID[COG1559] Predicted periplasmic solute-binding protein 
TIGRFAM ID[TIGR00247] conserved hypothetical protein, YceG family 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.0936456 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.529135 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GTATTAAACT TATTTCCAAT CGTGCCTTAT TTTACTTGGC CATTCTACCT 
GTAAGCTGTG GTGTTTTTGC TTGGCAGGGT TGGAGTTGGT GGAGTTGGGT AAGTAGACCT
GTAGTTTCAC CAACATCCTC TACTCAGTCT TCACAAGCTA ATGCTATAAG AATTAAAATT
CCTGTGGGAA CTTATGGTCA ACAAATAGGT GAGTATTTAG AAGATGCTGG TATTATTCGC
TCTGCAACAG CTTGGAATTT ATGGGTAAAA TGGTTGAGTC TACAAAATCC CAATCTTGAG
TTTAAAGCTG GAACTTATAA TTTATTGCCT ACAGAACCAC TAAGCGCGAT CGCAGATAAA
ATTCTACAGG GAGATGTAGT TAAACTCAGC TATGTCATTC GTGAAGGATG GTCAATTCAA
CAAATGGCTG CATATTTGGA TGATGAAGGT TTTTTTCCAG CTGCTGATTT TATTGCAGCA
ACAAAAAATA TTCCCTATGA TAAGTTTCCA TGGTTACCAA CTAATATACC TCATCTAGAG
GGTTATTTAT TCCCAGATAC TTATAAAATA GTAGCGGATA ATATTACTCC AGAAGCTATT
ATCAATCAAA TGATAGGACA GTTTGAACAA GTAGCTTTGC CAGTTTATCA GAAAAACCAG
AACAATACAA CAAAATTGAG TCTTCATGAA TGGGTAAGTT TAGCAAGTAT TGTAGAAAAG
GAAGCTGTAG TTGCACAAGA ACGTGGTTTA ATTTCGGGGG TGTTTAATAA CCGTTTGGAA
CAGGGTATGA GGTTAGCAGC AGACCCAACA GTAGAATATG GTCTTGGTAT TCGTCAAACG
AAAGATAAGC CTCTTACTTA TAGTCAGATT GAAACTCCTT CACCTTATAA TACTTATATG
AATACTGGGT TACCACCAAC TCCTATTTCT AGTCCAGGTA AGGCCAGTTT GGAAGCAACT
CTTAATCCAG AAGATACAGA ATATTTGTAT TTTATGGCTC GCTATGATGG TACCCATATT
TTTAGTCGTA CTGCTAGAGA ACATGAGGCT GCTATTGCAG AGGTAGAGAG ATTGTTATCA
TCTCAGTAA
 
Protein sequence
MKKSIKLISN RALFYLAILP VSCGVFAWQG WSWWSWVSRP VVSPTSSTQS SQANAIRIKI 
PVGTYGQQIG EYLEDAGIIR SATAWNLWVK WLSLQNPNLE FKAGTYNLLP TEPLSAIADK
ILQGDVVKLS YVIREGWSIQ QMAAYLDDEG FFPAADFIAA TKNIPYDKFP WLPTNIPHLE
GYLFPDTYKI VADNITPEAI INQMIGQFEQ VALPVYQKNQ NNTTKLSLHE WVSLASIVEK
EAVVAQERGL ISGVFNNRLE QGMRLAADPT VEYGLGIRQT KDKPLTYSQI ETPSPYNTYM
NTGLPPTPIS SPGKASLEAT LNPEDTEYLY FMARYDGTHI FSRTAREHEA AIAEVERLLS
SQ