Gene Phep_1963 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1963 
Symbol 
ID8253067 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2265893 
End bp2267113 
Gene Length1221 bp 
Protein Length406 aa 
Translation table11 
GC content35% 
IMG OID644935614 
Productglycosyl transferase group 1 
Protein accessionYP_003092233 
Protein GI255531861 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones13 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.0000383254 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGTCAAAAG CAATAAATGG TGTTTCTTCC GGTTTTATTA ACGACGTCAT CATTGTAGGG 
CAACAACCAT GGGATACCGA AATCGGAAGC AACTGCAAGG ATATAGCTTT TGAACTTAGT
AAAACAAATA GAGTTTTATA TGTGAATTCT GCAATTAACA GAATTACCCT GATCAAGGAA
AGAGACAATC CGAAAACAAA ACTACGTTTG GATGTTGTAC GGGGAGAGGA AAATGGCCTG
GTAAAAATAA CAGACAATTT ATGGACTTAC TATCCGGATT GCGTTGTTGA ATCTATCAAT
TGGATTAACC CTACTTTTAT TTTTAACATA TTAAATAAAC GCAATAACAG GAAATTCGCA
CATGCCATAA GCAAAGCGCT AAATACTTTA GGATTTAAAA ATCCCATACT GTTTAATGAC
AATGAAATGT TTCTCGGCTT TTATCTCAAG GAGTTTTTGC AGCCAAGGCT TAGTATTTAT
TACGCGCGAG ATTATATGAT TGCGGTTAAT TATTGGAAAA AACATGGAGC TAAATTGGAG
CCAATCCTGA TTGGCAAAAG CGATATTTGT TTTACAAACT CAGCCTATCT CGAAAGGTAC
TGCATGTTAT ATAACAACAG TTCTTTTAAT GTTGGACAAG GATGCAATAT CGAAGCATTT
AAGAATGTTC CTGATAAAAA GATCGAAGAG TTAAGTCATA TAGGCAAACC TCTTATAGGT
TATGTTGGTT CTTTGAGCAG TATTCGTTTA AATATACAAC TTCTAGAATT GATTGCAGTT
TCCTATCCTC AATACGTACT GGTTCTGGTA GGACCAGAAG ATAATGAATT TATACAGAGT
AATCTGCATA ACTTACCAAA CGTATTATTT CTTGGACAAA AACCTACCGA GGAACTGCCA
TTATATATTC AAATGTTCGA TATATGTATT AATCCGCAGG AGATTAATGA AGTGACTATA
GGAAATTATC CTAGAAAGAT AGATGAGTAC CTGGCCATGG GAAAACCGGT AGTTGCAACA
AGAACAATAA CAATGGATGT TTTTGAAGAT TATGTCTATT TAGCAAACAA CCAAAGTGAA
TTTATAGAAC TTATCAATAA GGCATTTGCC GAAGATGATG AAATTAAAAT AGAAACACGC
AAAGCTTTCG CCTTTACCCA TACATGGGAG AGGAGCGTAC AGGAGATGAG GGATAAAATA
GCTTTGGAAT TACAAAAATA A
 
Protein sequence
MSKAINGVSS GFINDVIIVG QQPWDTEIGS NCKDIAFELS KTNRVLYVNS AINRITLIKE 
RDNPKTKLRL DVVRGEENGL VKITDNLWTY YPDCVVESIN WINPTFIFNI LNKRNNRKFA
HAISKALNTL GFKNPILFND NEMFLGFYLK EFLQPRLSIY YARDYMIAVN YWKKHGAKLE
PILIGKSDIC FTNSAYLERY CMLYNNSSFN VGQGCNIEAF KNVPDKKIEE LSHIGKPLIG
YVGSLSSIRL NIQLLELIAV SYPQYVLVLV GPEDNEFIQS NLHNLPNVLF LGQKPTEELP
LYIQMFDICI NPQEINEVTI GNYPRKIDEY LAMGKPVVAT RTITMDVFED YVYLANNQSE
FIELINKAFA EDDEIKIETR KAFAFTHTWE RSVQEMRDKI ALELQK