Gene Tery_3140 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3140 
Symbol 
ID4244270 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp4795204 
End bp4796496 
Gene Length1293 bp 
Protein Length430 aa 
Translation table11 
GC content29% 
IMG OID638108150 
Productsulfotransferase 
Protein accessionYP_722743 
Protein GI113476682 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value0.615346 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.169437 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAGTGATT TTTCCCAAAA GAATTTAATA TTTCTTATTT CACAACCCAG AGCAGGATCA 
ACTCTTACTC AACGTATTTT GGGTAGTCAT CAAGATATTC ATACTATATC TGAACCTTGG
ATTATGTTAC ATCCTTTCTA TGCACTGCGT GATAAGGGGT GTCAAATGGA GTATAGTGCA
GTTAATAGTA AAAAAGGACT TAATAACTTT CTATGGTTAC ATCCTCAAGG GGAAGAAGCA
TATTTTCAGT CAGTAAATAA AATGTGTCTT AATTTATATG AGGGAGTTAT CAAGGAATCT
GGCAAAAAAT ATTTCTTAGA TAAAACGCCA AGATACTATT ATATTTTGCC TGAGTTGTAC
AGAACTTTTC CTTCTGCTAA ATATATTTTT TTATTAAGAA ATCCCTTAGC TGTTTTATGT
TCTATTTTTA ATACTTTTAT TCAAGAACAT TGGTGGAGAA TCCAATATTA TCAAGGTGAT
CTTTTAAAAG CACCTATTTT AATTGCTCAA GGAATGGTTG AATTACAAAA CAAGAGTATT
GTGCTCAGCT ATGAACATTT ACTCGTTAAT CCTAATCAGG AAATCAAAAA AGTTTGTAAA
TTTCTTAATA TTCCTTTTGA TGAGAAAATA TTAAATTATG GTGAGAGTTC CTCACAGAAA
TGGGAATTTG GAGATCAATC TCAAATTTAT CAAGAAAAAA CTCCTAATTC TCAACATAGA
GATCGTTGGA AAAAAGATTT AGATAACCCG ATTATTTGGC AATGTGTATC TAATTATTTA
GAGTTTTTAG GAAATGATCT TCTTAATAGT TTAGGTTATT CTTATGAAGA AGTTAAAAAT
ATTCTTTCTG ATTATAGTTA TCAAACTAAC ATAGTTTTAC CTCCTGCTCT GAAGGATTTT
TTTACTAGTG CAAATCTTTT TAAAAATAAA GCTTTACAAC CTTATTTAGA AGCAGTGGAA
TTAAACCCCC AAATATTCCA CCCTTATCTA GATCTTGGCA AAGCATTATT AGAAAAAAAA
GATTTTAAAA AAGCTCTTAA TTATCTACAA ATAGCTTTAA AATTAGCTCC TTATATACCA
GAAATTCATT TTTTAATAGG AGAAAATCTT TTAGGTTTAG GTGAATTAGA TCAAGCTATT
ATTTATTATC AAAAAACTAT TGATTTAGAC TTTAGATTTG TCAAAAATTA TGATAAGATA
GAATCTACAA TAATGGCTCT TAAAGAAGTC GCTCAAGTTA ATCCAAATCA TCAGGAGATC
GCTAATTTAA TCAAAACAAT AACAAATATT TGA
 
Protein sequence
MSDFSQKNLI FLISQPRAGS TLTQRILGSH QDIHTISEPW IMLHPFYALR DKGCQMEYSA 
VNSKKGLNNF LWLHPQGEEA YFQSVNKMCL NLYEGVIKES GKKYFLDKTP RYYYILPELY
RTFPSAKYIF LLRNPLAVLC SIFNTFIQEH WWRIQYYQGD LLKAPILIAQ GMVELQNKSI
VLSYEHLLVN PNQEIKKVCK FLNIPFDEKI LNYGESSSQK WEFGDQSQIY QEKTPNSQHR
DRWKKDLDNP IIWQCVSNYL EFLGNDLLNS LGYSYEEVKN ILSDYSYQTN IVLPPALKDF
FTSANLFKNK ALQPYLEAVE LNPQIFHPYL DLGKALLEKK DFKKALNYLQ IALKLAPYIP
EIHFLIGENL LGLGELDQAI IYYQKTIDLD FRFVKNYDKI ESTIMALKEV AQVNPNHQEI
ANLIKTITNI