Gene Ppha_1001 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPpha_1001 
Symbol 
ID6462480 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePelodictyon phaeoclathratiforme BU-1 
KingdomBacteria 
Replicon accessionNC_011060 
Strand
Start bp1035586 
End bp1036821 
Gene Length1236 bp 
Protein Length411 aa 
Translation table11 
GC content50% 
IMG OID642727252 
Productglycosyl transferase group 1 
Protein accessionYP_002017902 
Protein GI194336108 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones20 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clonesn/a 
Fosmid unclonability p-valuen/a 
Fosmid Hitchhikern/a 
Fosmid clonabilityn/a 
 

Sequence

Gene sequence
ATGACAACAA GAGCGATAGC CTATCTTTGC AGTGAGTATC CTGCAATTTC TCACACCTTT 
ATTTATAGAG AGATTGAGTC GCTCCGAAAA GCCGGAATGA GTGTCCACAC GGCATCAATC
CGGAAGCCAT CGAACCTTGA CGTCATGACT CCGGATGAAC AACAGGAGGC TGGTCGTACC
CTCATGGTAC TTTCCATGCC ATTACCCCAA ATGCTTGCCG CTCATCTGCA CTGCATAAAG
AAAAACCCGA CGGGATACCT GCGCATGGCC GCATCGGGGT TACGACTGCT GACCACTGGA
CCAAAAAGTC CGCTGAAGGC GGCGGCCTAT TTTGCTGAAG CGGGCATTCT CCTGCAGTGG
CTTCACCAAA ATGGCGTTAC CCACATTCAC GAACATTTTG CAAACCCCAC CGCCATTGTA
GCAATACTGA TGAAAACCTA TGGAGGAATA AGCTACAGCA TCTCCGTACA TGGCCCTGAT
ATCTTTTATA CCGTTGATTC CGCCATGCTT CCCGAAAAAA TCCGGAAAGC TGCCTTTGTC
CGCTGCATCA GCCACTACTG CAAAAGTCAG ATCATGCGCC TCAGTGAGCC ACAAGAGTGG
CATAAACTCC ACATTGTCCG CTGTGGAGTA GATCCTGATC TTTATACTCC AAGAGCTGAA
CCCGAGAACC CGGTACCGAA CATCCTCTGT GTTGGAAGGC TGGTACCAGC AAAAGGTCAA
TATATCCTGC TCGAAGCTTG TGGGCTCCTG AAAAAAGAGG GACTCGAATT TCATCTCACC
ATTGTTGGTG ATGGTCCCGA CAAGCTTATA CTTGAAAAGG CTGTCAACTC AGGCAACATG
AAAGGCTCGG CAACCTTTAC CGGTGCACTC GGACAGGATA AAGTGCGCGA CTATTATCAA
CGGGCCGATC TCTTTGTGCT TGCAAGCTTT GCCGAGGGTG TCCCTGTCGT GCTGATGGAA
GCCATGGCAA AAGAGATTCC GGTTATTTCA ACCCGAATCA CCGGAATACC CGAACTCATT
GAACATGAAA AAGATGGGCT GCTGGCAACA CCGGGTGACG CAGAAGATCT TGCCAACCAG
ATCCGCAAGT TGCTCACCAC CCCCCGACTA CGTCGAGAAC TTGGAGTTGC AGGCCGAAAA
AAAGTTGTCA CACTGTACAA CCAGCACACC AACAACCAAC AAATGGCAAC TCTGTTTCAT
GAAGAAGAAC TTGTATCGCA AAGAACTCGA ACGTGA
 
Protein sequence
MTTRAIAYLC SEYPAISHTF IYREIESLRK AGMSVHTASI RKPSNLDVMT PDEQQEAGRT 
LMVLSMPLPQ MLAAHLHCIK KNPTGYLRMA ASGLRLLTTG PKSPLKAAAY FAEAGILLQW
LHQNGVTHIH EHFANPTAIV AILMKTYGGI SYSISVHGPD IFYTVDSAML PEKIRKAAFV
RCISHYCKSQ IMRLSEPQEW HKLHIVRCGV DPDLYTPRAE PENPVPNILC VGRLVPAKGQ
YILLEACGLL KKEGLEFHLT IVGDGPDKLI LEKAVNSGNM KGSATFTGAL GQDKVRDYYQ
RADLFVLASF AEGVPVVLME AMAKEIPVIS TRITGIPELI EHEKDGLLAT PGDAEDLANQ
IRKLLTTPRL RRELGVAGRK KVVTLYNQHT NNQQMATLFH EEELVSQRTR T