Gene Tery_3696 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagTery_3696 
Symbol 
ID4243871 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameTrichodesmium erythraeum IMS101 
KingdomBacteria 
Replicon accessionNC_008312 
Strand
Start bp5676807 
End bp5678192 
Gene Length1386 bp 
Protein Length461 aa 
Translation table11 
GC content41% 
IMG OID638108642 
ProductO-antigen polymerase 
Protein accessionYP_723229 
Protein GI113477168 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG3307] Lipid A core - O-antigen ligase and related enzymes 
TIGRFAM ID[TIGR00947] probable bicarbonate transporter, IctB family 


Plasmid Coverage information

Num covering plasmid clones19 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.312837 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAACTCAG TTTGGAAAAA GTTAACACTA ACTAATCTCT CATTCTCTGA CTCTGAATGG 
TTAAATGCAA GCTATCTCTA TGGTTTACTT AATGGTTCCC TCTATAACTG GCGACGTGGT
AGTTGGTTAA TGCAATGGGG AGAACCTCTT GGTTTTGTGT TGCTAGCAAT TGTATTTACT
CTAGCTCCTT TTGTAAATAC TACTCTCATT GGTTTCTTAT TACTTGCTAG CGCTGGTTTT
TGGGTATTGC TGAAGGTCTC GGATAACACC CAGGAATATT TAACTCCTAT TCATCTATTA
ATATTCCTCT ACTGGAGTAT TGCGACATTG GCAGTGGTGA TATCTCCGGC AAAGACTGCT
GCTTTTAGTG GCTGGGTAAA GTTGACTCTT TATTTATTGT TGTTTGCTTC GGGGTCTTTG
GTATTAAGAT CCCCTAGACT CCGCTCTTGG TTAATCAATA TTTATTTGTT GGTTTCTCTA
GTTGTTAGTT TTTATGGTAT TCGCCAATGG ATAGATAAGG TTGAACCTCT GGCTACCTGG
AATGATCCTA CTTCTGCTCA AGCAGGTGCG ACTCGTGTTT ATAGTTATTT GGGAAATCCT
AATTTATTGG GTGGATATTT GTTGCCTGCT ATTGCTTTGA GTTTTGTGGC AATTTTTGCT
TGGAGTAGTT GGGCTCGAAA ATCTCTGGCA GTAACAATAT TGCTGGTGAG TTGTGCTTGT
TTGCGTTATA CAGGTAGTCG AGGTAGTTGG ATTGGGTTTT TAGCTTTGAT GTTTGCTATG
TTGATTTTAA TGTGGTATTG GTGGAGGAGC TATATGCCCA GTTTTTGGCA AATTTGGTCT
CTGCCTATAG CTGTGGGTAG TTTTGCCGGG TTGTTGATTT TAGCGGTGGT GTTGTTAGAA
CCTTTGCGCG ATCGCGTCCT GAGTGTTTTT GCGGGTCGTC AAGATAGCAG TAATAATTTT
CGGATGAATG TTTGGATGTC TGTTTTTGAT ATGATTCGCG ATCGCCCTAT TTTGGGTATT
GGACCGGGTA ATGATGTGTT TAATAAGATT TATCCTCTCT ATCAGCGTCC CCGTTATAGT
GCTTTGAGTT CTTATTCTGT GCCTTTGGAA ATTGTTGTGG AAACTGGTTT TATTGGTTTG
ACTGCTTTTT TGTGGTTGCT TTTGGTGACT TTTAATCAGG GTGTATTGCA GTTGAAACGT
TTGCGAGATG CTGATAACCC TCAAGGATAT TGGTTAATTG GTGCGATAGC TGCTATGGTG
GGCTTGATAG GTCATGGTTT GGTGGATACG GTCTGGTATC GTCCCCAAGT TAATACTATT
TGGTGGTTGA TGGTGGCTAT TATTGCTAGT TATAGCAGTC AACAGGGGGT ACGGAGTAGG
GAATAG
 
Protein sequence
MNSVWKKLTL TNLSFSDSEW LNASYLYGLL NGSLYNWRRG SWLMQWGEPL GFVLLAIVFT 
LAPFVNTTLI GFLLLASAGF WVLLKVSDNT QEYLTPIHLL IFLYWSIATL AVVISPAKTA
AFSGWVKLTL YLLLFASGSL VLRSPRLRSW LINIYLLVSL VVSFYGIRQW IDKVEPLATW
NDPTSAQAGA TRVYSYLGNP NLLGGYLLPA IALSFVAIFA WSSWARKSLA VTILLVSCAC
LRYTGSRGSW IGFLALMFAM LILMWYWWRS YMPSFWQIWS LPIAVGSFAG LLILAVVLLE
PLRDRVLSVF AGRQDSSNNF RMNVWMSVFD MIRDRPILGI GPGNDVFNKI YPLYQRPRYS
ALSSYSVPLE IVVETGFIGL TAFLWLLLVT FNQGVLQLKR LRDADNPQGY WLIGAIAAMV
GLIGHGLVDT VWYRPQVNTI WWLMVAIIAS YSSQQGVRSR E