Gene Tery_4043 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_4043 
Symbol 
ID4242071 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp6247182 
End bp6248372 
Gene Length1191 bp 
Protein Length396 aa 
Translation table11 
GC content37% 
IMG OID638108949 
Productglycosyl transferase, group 1 
Protein accessionYP_723530 
Protein GI113477469 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.121484 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.172478 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAATTC TAGTATTAAG TTGGGAGTTC CCACCGAGAA TTATAGGAGG AATAGCACGT 
CATGTTGCAG AACTATACCC AGAATTGGTG AAATTAGGAC ATGAAGTTCA TTTAGTTACA
GTCAAGTCTG GTGAAGCACC AATGTATGAA ATTGTAGAGG GAATAGAAGT CTACAGAGTG
CCAGTTGGAC CAAGCCACAA CTTTTTTCAT TGGATAGGAA ATATGAATGA AGCTATGGGT
CGTTATGGAG GAAAACTAAT TAAAGAAGAG AAAAATTTTG ATATAATTCA TGCCCATGAT
TGGCTAGTTG CAGATGCGAC TATCGCTCTT AAACATATCT TTAAACTACC ACTAATAGCT
ACTATTCATG CCACAGAAAA TGGTCGCCAT AATGGTATTC ATAATCGTAG CCAACAGTAT
ATTCATGAAA AGGAAAAAGA GTTAATTTAT AATGCTTGGA GAGTGATTGT TTGCTCAAAC
TATATGCGAG GAGATGTAAC AAGAACTTTA GATAGTCCTT GGGACAAAAT AGACGTAATT
TATAATGGAA TTTGTCCTGA AAAAAAACCT ACTCTAAATC AGTTTGATTA TCTACATTTC
CGTCGGCATT TTGCAACAGA TGAAGAAAAA ATTGTTTACT ACTTAGGTAG AATGACTCCA
GAAAAAGGTT TGTCAGTGTT AATTCATGCA GCACCTAGAG TAATTGAAGA AATGGGAGAT
AGGATAAAAT TTATTATGAT TGGTGGTGGC AAAACTGACT ATTGGAAACA GGAAATCTGG
AATTTAGGAA TTTCGGAAAG ATTCTATTTC ACAGGGTTTA TGTCTGAGGA AAAATTAGAT
AAATTCCAGG CGATCGCAGA TTGTGCAGTA TTTCCTAGCT TATACGAACC GTTTGGTATT
GTTGCCCTAG AAAGTTTTGC AGCAAGGGTG CCAGTGGTGG TTTCAGATAC CGGTGGTTTG
CCAGAAGTGG TAGAACATGG TAAAACGGGT ATTGTTACTA AAGTTGGTAA TCCTACTTCT
CTAGCATTGG GTATTCTAGA AGTTTTGAAA GGCCGTAGCT TTGTCAAAGA GTTGGTGAAT
AATGCTTATC AGGAATTAGA GAATAAATTT TGCTGGGGTA AAATAGCAAA ACAAACTGAT
AGAGTGTATC ACAGAGTACT AGCAGAAAGG ACGCAAGTGA CTTGGAAATA G
 
Protein sequence
MKILVLSWEF PPRIIGGIAR HVAELYPELV KLGHEVHLVT VKSGEAPMYE IVEGIEVYRV 
PVGPSHNFFH WIGNMNEAMG RYGGKLIKEE KNFDIIHAHD WLVADATIAL KHIFKLPLIA
TIHATENGRH NGIHNRSQQY IHEKEKELIY NAWRVIVCSN YMRGDVTRTL DSPWDKIDVI
YNGICPEKKP TLNQFDYLHF RRHFATDEEK IVYYLGRMTP EKGLSVLIHA APRVIEEMGD
RIKFIMIGGG KTDYWKQEIW NLGISERFYF TGFMSEEKLD KFQAIADCAV FPSLYEPFGI
VALESFAARV PVVVSDTGGL PEVVEHGKTG IVTKVGNPTS LALGILEVLK GRSFVKELVN
NAYQELENKF CWGKIAKQTD RVYHRVLAER TQVTWK