Gene Paes_1994 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPaes_1994 
Symbol 
ID6459867 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism nameProsthecochloris aestuarii DSM 271 
KingdomBacteria 
Replicon accessionNC_011059 
Strand
Start bp2192090 
End bp2193241 
Gene Length1152 bp 
Protein Length383 aa 
Translation table11 
GC content53% 
IMG OID642725979 
Productglycosyl transferase family 2 
Protein accessionYP_002016653 
Protein GI194334793 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones39 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.713499 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGATGATCT ATCAACTCTC AATTCTGTTC TGCCTTCTGG TTTTTTTTGT GATTGTTCTC 
CTGAACCTGA AGGATTTCAA AAAGCTACCC TCGCGCGCTA CCAGCGTATC TCCTCTGGTT
TCGGTGCTCG TTCCCGCGCG TAATGAAGAC AACACCATAG CCCGGTGCAT CGAGTCGCTC
CTGATTCAGG ATTACGGTAA TTTCGAAATC ATTGTGCTCA ATGACGGCTC TACCGACCGT
ACTGCTGAGG TGCTGCAGTC GATAGTCAGC TCAGTTCAAG GGGCCGCATT GCGTGTGATT
GACGGAACGA CGCTGCCTGA TGGCTGGCAC GGCAAGGCCT GGGCCTGTCA GCAGCTCGGT
GCAGAAGCCA GGGGAGAACT GCTGCTGTTT ACCGATGCCG ATACCGTTCA TGCTCCCGAC
AGTGTTCGGC GGGCCGTTGC GGCTCTGGAC GAGAGCCGGG CCGATATGCT CTCTCTGACA
CCCTATCAGG AGACGAAGTC GTTCTGGGAG CGCCTGATTA TTCCTGTGAT GTACGTTATC
GTGTTCTGCT ACCTTCCTTT GCGCATGGTC CGGGAACATG CTTCCGAAGC GTTCTGCTTC
GCCAACGGTC AGTTTATCAT GATGCAGAGA AAGATGTACG ATCTCATCAA CGGCCACAGC
GCAGTCAGAC GCAATATTGT CGAGGATGTC TGGTTGTGCA AAGCGGTGAA GAGAGCCGGA
GGCAGCGTTG CGGTCTACAA TGGTACCGAC ACGGTGCGTT GCAGAATGTA TCGGACTCTT
TCGGAGATAT GGCAGGGGTT TTCCAAAAAC CTCTTTGCAG GGCTTGGATA CAACACTATC
GGACTCTTTA GCCTGATTGT CATGACGGCA TTATTCTACA TCGTACCCTA TATTTTTGTT
TTTCGGGCAC TGCTTCTGCA GGATTATTCG TTCTTCGTTT TCTGGCTGCC TTGCCTGCAG
ATTGCTCTTG CCATGCTCAT GCGGCTCTTC ATTGCAGTAC GATTCCGTCA GCCGCTCAGC
GGTGCACTGC TGCACGGTTT GTCGCAGCTG ATGCTCATCG CTCTGGCCGC AAATTCGTTT
TACCTCGTCA GATTCGGCGG CGGAGCGCGA TGGAAGGGGC GCCAATATGA TTTTTCAGAC
CATCAATCCT GA
 
Protein sequence
MMIYQLSILF CLLVFFVIVL LNLKDFKKLP SRATSVSPLV SVLVPARNED NTIARCIESL 
LIQDYGNFEI IVLNDGSTDR TAEVLQSIVS SVQGAALRVI DGTTLPDGWH GKAWACQQLG
AEARGELLLF TDADTVHAPD SVRRAVAALD ESRADMLSLT PYQETKSFWE RLIIPVMYVI
VFCYLPLRMV REHASEAFCF ANGQFIMMQR KMYDLINGHS AVRRNIVEDV WLCKAVKRAG
GSVAVYNGTD TVRCRMYRTL SEIWQGFSKN LFAGLGYNTI GLFSLIVMTA LFYIVPYIFV
FRALLLQDYS FFVFWLPCLQ IALAMLMRLF IAVRFRQPLS GALLHGLSQL MLIALAANSF
YLVRFGGGAR WKGRQYDFSD HQS