Gene Tery_3809 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3809 
Symbol 
ID4242260 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5856287 
End bp5857933 
Gene Length1647 bp 
Protein Length548 aa 
Translation table11 
GC content28% 
IMG OID638108744 
Productglycosyl transferase family protein 
Protein accessionYP_723327 
Protein GI113477266 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0772252 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTCAAAA ATATAAACCA AAAACTAATA ATTTTTATTG GGTTTCTTCT ATTATCATTC 
TTTCTCAGAT TTTGGACTTT ATTTGTATCT GTGTTAGACA AAGATGAAAG TATTTATATA
TTAGGAGCAG ATAGTTTATT AAATGGTAAT CTCCCCTATA CAGAAATTTG GGATAATAAA
CCTCCTGGTA TTTTTATATT GTTTTCCCTA GCAATGCTAA TTTTTGATAG GTCTATAGTA
TCAATCAGAA TTATATCTAT TCTGGCTACA ACTTTTACCA GCTATTTTTT ATATAGAATT
GGGGCAACGA TTGATCAAAA GCAAGGAGAA AAGATAGGAT TATTAGCAGG TGTTTTATAT
GCTATTTTTT CTTTACATAA TGATGGTGCT GCTGCCAATG CAGAAATTTT CTTTGCTCCT
TTTGTTACAG TAGGATTTTT ATTGTTATTT CGGAATAGAC AATTATCAAA TATTAAAGTA
TTTATGATAG GTCTAATTTT TGGGATAGGG ATGCAAATTA AATATTTGGT GATTATGGAT
ACCTTGGCAT TGGTTTTATT AGGGACTTGG TTTAGGAAAG AAAGAAAAAT AAAAGAAAAG
GAAAAAGAAG GGCTTATAAA AAAATTGAAT TCCACATTAA AGTTTTATCT AATTTTCGGT
ATAGGCTTAA TTTTACCTGC AATTTTTATT GCATTTATTT ATCAATTTTA TGGGTATTTT
GATGAATATA TCTATGCAAC TATAAGTGCT AATAGTAAGT ATGTAGCGAT GTTAGATTTT
TCTTTTTCAG ACCTTTTAAG TAGACTCAGA AAACAGGTAT TAGGAAATAT TTTGTTATGG
TTATGTTTGT TCTGGAGTCC GATTTATTTT TTTGTATTTG CTAGGGGTAA GTTTAAACAA
GAGCGCAATT TAATCTATTT ATTTTTGTGG TTTAGCTGTG CTTTTTTGGC AGTTTTGTTA
TCTAAGCGAT TTTATAATCA TTACTTTTTA CAATTATTAC CTCCATTATG TTTAATTAGT
GGATATATCA TTATTAAATC TGTTTTTTTA CCTCAAAATC TGGTAAATGA ACATCATTAT
GGAGAAGCAA TAAAAATGAA AAAAAATAGT ACTCAAGCTC AAATAAATGC TTTGATTAAT
TACCAATTAA ATGTAATTAT CCGCCCTTAT TTAGCTAATA TATTTTTGTT CTTTATTTTG
ATATATCCTT TTGCTCAAGC TGGTTACAAT AAGTTAAGTA AGAATTGGGA ATTTATCTAT
TATCGATACA TTCAAAAAAT AGATACGTGG GATGACAGAG AAGCTTTGAT TGCTAAATAT
TTGAGACAAA GAATAAAATC AAATGATTAT ATATATGTAG TTAATTATGA ACCCATAATT
TATTATTTAG TGCCCACAAA AGTTCCGACA AAATATGCTT TTCCTAGTCA TTTAACAGCT
ATGCATCAGA TTTTGCCTAC TAATTATCTA CAAGAATTAG ATAACATTAT GGCGAAAAAT
CCTAGTTATA TTTTACTTGC GGAAAAAGAT AATATTAGCC CTGAATATAG AAATGCTCTC
AATCAATATT TAGAGGCAAG TTTTTTTCTG GAAACTACAA TTAAAAATGT AAAATTATAT
CGGATAAATG TAAGTAGCAG TGATTAA
 
Protein sequence
MLKNINQKLI IFIGFLLLSF FLRFWTLFVS VLDKDESIYI LGADSLLNGN LPYTEIWDNK 
PPGIFILFSL AMLIFDRSIV SIRIISILAT TFTSYFLYRI GATIDQKQGE KIGLLAGVLY
AIFSLHNDGA AANAEIFFAP FVTVGFLLLF RNRQLSNIKV FMIGLIFGIG MQIKYLVIMD
TLALVLLGTW FRKERKIKEK EKEGLIKKLN STLKFYLIFG IGLILPAIFI AFIYQFYGYF
DEYIYATISA NSKYVAMLDF SFSDLLSRLR KQVLGNILLW LCLFWSPIYF FVFARGKFKQ
ERNLIYLFLW FSCAFLAVLL SKRFYNHYFL QLLPPLCLIS GYIIIKSVFL PQNLVNEHHY
GEAIKMKKNS TQAQINALIN YQLNVIIRPY LANIFLFFIL IYPFAQAGYN KLSKNWEFIY
YRYIQKIDTW DDREALIAKY LRQRIKSNDY IYVVNYEPII YYLVPTKVPT KYAFPSHLTA
MHQILPTNYL QELDNIMAKN PSYILLAEKD NISPEYRNAL NQYLEASFFL ETTIKNVKLY
RINVSSSD