Gene Tery_3477 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3477 
Symbol 
ID4244477 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5355116 
End bp5356810 
Gene Length1695 bp 
Protein Length564 aa 
Translation table11 
GC content36% 
IMG OID638108451 
Productglycosyl transferase family protein 
Protein accessionYP_723040 
Protein GI113476979 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1807] 4-amino-4-deoxy-L-arabinose transferase and related glycosyltransferases of PMT family 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones16 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGCTTAACA AAATATTAAC AAAGAAAAAC TTACATTCTC ATATAGATTT ACTGCTAACG 
CTAGGCTTAT TGCTAGCAGC AGTAGTACTT TTTTCTACTA ATTTAGGAAC ATTGCCTTTA
CGAGACTGGG ATGAAGGAAT AGTTGCCCAA GTAGCCAGAG AAATCAGCCG AGACAAGTGG
AATTGGCTTT ATCCTACTAT CAATAATACT CCTTATTTTA ATAAACCACC ATTGATACAT
TGGTTGATCG CTTTTATCTA TAGTATTGCA GGAGTCAATG AATTCAATGC CCGTTTATTT
CCCGCTTTGT TAACAGCATT TTCTGTACCT TTAATCTATG GTATTAGTCG AGAATTATTT
CACCCACGCA CTCCGGCAAT TTTTACCGCT TTGGTTTACT TAACTTTACT CCCTGTAGTG
CGACATGGGC GTTTAGCAAT GTTAGATGGT GCAGTAGTTT GTTTTTTTTT ATTAATGATT
TGGTGCGTAT TGCGCTCACG GCAAAACCTC CGCTATTCTC TCCCTATTGG TATTAGTTTT
GCTTTAGTTT CCCTCAGCAA AGGAATTATT TTAGGATTAC TTTTAGGAGC GATCGCTTTT
TTGTTTCTCT GGTGGGATAC TCCCCGACTA TTAAGCAATA AATATTTTTG GAGTGGTATC
CTATTAGGTA TGTTACCAGT TTTTCTATGG TACACAGCTC AATTTTTTCA TTACGGCGTT
GAATTTTTCT ATGCCAACTT TTTCCATCAA TCTTTAAAAC GTATTTGGCA ACAAGTAGGT
AATCATGATG GACCTATTTG GTATTATTTA TTAGAAATTA TTAAGTATAG TTTTCCATGG
CAGCTATTCT GGTTACCAGG ATTATATTTG AGTTGGAAAA ATCGTAGTCT GAGTTGGGGT
AAATTAGTTT TGATTTGGAC TGGAGTTTAT CTGTTTGCTA TTTCATTAAT GAATACAAAA
CTTCCTTGGT ATGTGTTACC TATTTATCCA GCTTTTGCCT TAGCAGTAGG TAGTTATATA
ACAGAAATCT GGGATCAATT TCCATTAGAC TTAGGATGCT TTTGGTTTTC AGGTGATAAA
TATCTTCCAA CTCACGGAGA TCATAAAAAA CATCTCTGGA GTCTTCCCAC TATTTACCAT
CGTTTAGTAG TAGCTTTATT TGCACTACTA GCAATAATAG CTTGGGCTGC TTCTGTTTAC
TTTAGTGGGA TTTTTGATCT AGGGGAGCAA AATTTAGCAA AACCTAACTT ACAGTTACAG
TTAGTTGCTG TGGCTCTTGC ATTGACAATG ACAATGGTAA CTCTACTGTT AAATAAACAA
CAGCATCAAT TTTTATTAAT TTTGATTTGG GGAACATATC TTTCACTGCT AATGTTTGTT
AGCTCTCCTT ATTGGATATG GGAATTGGAA GAAAATTATC CAGTCAAACC AGTCGCAGAA
ATGATTCAGA AAGATACCCC CCCGGGTCAG GTTATTTATT CTTTTGACAC TAAAGACCGT
CCCTCTTTAA ATTTTTATAG CGATCGCCTC ATTAAGCGTG TTGGCCCAAA AAAAATTCAA
CAGCAATGGC AAAAAACAAC TCAACCCTAT TTATTAGTTG AAGCATTAAC TCTAAATAAT
CTCCCCCTAG AAAATTTTCA GGTTTTGAAT ACTGTCAAAG GATGGTCCTT AGTTACAAGA
GAAGGTAAAA GATAA
 
Protein sequence
MLNKILTKKN LHSHIDLLLT LGLLLAAVVL FSTNLGTLPL RDWDEGIVAQ VAREISRDKW 
NWLYPTINNT PYFNKPPLIH WLIAFIYSIA GVNEFNARLF PALLTAFSVP LIYGISRELF
HPRTPAIFTA LVYLTLLPVV RHGRLAMLDG AVVCFFLLMI WCVLRSRQNL RYSLPIGISF
ALVSLSKGII LGLLLGAIAF LFLWWDTPRL LSNKYFWSGI LLGMLPVFLW YTAQFFHYGV
EFFYANFFHQ SLKRIWQQVG NHDGPIWYYL LEIIKYSFPW QLFWLPGLYL SWKNRSLSWG
KLVLIWTGVY LFAISLMNTK LPWYVLPIYP AFALAVGSYI TEIWDQFPLD LGCFWFSGDK
YLPTHGDHKK HLWSLPTIYH RLVVALFALL AIIAWAASVY FSGIFDLGEQ NLAKPNLQLQ
LVAVALALTM TMVTLLLNKQ QHQFLLILIW GTYLSLLMFV SSPYWIWELE ENYPVKPVAE
MIQKDTPPGQ VIYSFDTKDR PSLNFYSDRL IKRVGPKKIQ QQWQKTTQPY LLVEALTLNN
LPLENFQVLN TVKGWSLVTR EGKR