Gene Phep_0402 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0402 
Symbol 
ID8251487 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp474184 
End bp475263 
Gene Length1080 bp 
Protein Length359 aa 
Translation table11 
GC content38% 
IMG OID644934050 
Productglycosyl transferase family 2 
Protein accessionYP_003090688 
Protein GI255530316 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones21 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones20 
Fosmid unclonability p-value0.20489 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGTGGTGA TCACCTTCAT AAGAGGATGG CATAAACTGA TAAATTTTAA ACCGCAACCA 
GTAATTTTAA CCACAAAAGT TTCTGTACTT GTTGCTGCCC GTAATGAGGC CCCGAACATT
GCTAAAACAA TAGAAGATCT GATTGCCCAA AATTACGCCA GGGATTTAAC GGAAATCATA
TTTATTGACG ATCACTCTAC CGACCAGACT GCTGCCATTA TCAGTTCCTA TTCTGATAAA
GGAGTGAAAC TCATTCGTCT GAATGAAGAT CAGCCCCTTA ATTCCTATAA AAAGAAAGCT
ATACAAACTG CTATTGCCCA GGCTAGAGGT AGTTTGATCA TTACAACTGA TGCAGATTGC
AGAATGGGAC CAGATTGGCT TAAAACCATA GTTAGCTTTT ATGAGGAGAA GAAATATAAG
ATGATCTCCT CTCCGGTTGC TTATTTTGAA GAAAAAAGTT TTTTTGAAAA GGCACAAACG
CTTGAATTTT TGTATTTAAT TGGTTTGGGT GCTTCAACTA TAGGCAATAG AAAACCATCA
ACCTGTAATG GGGCCAACTT AGCTTACGAA CGGGAAGCTT TTTACGAGGT AGGTGGTTTT
AAAGGAATTG ACGATCTTGC CTCAGGGGAT GATGAGCTGC TGTTACATAA AATGGCCGCA
AAGTTTAACG GCCATATCGG ATTCTTAAAA AATGAGGATG CTGTAGTATA CACACATCCA
AAAGCCACAT TAAAGGAATT CATTCAACAG CGTAAACGAT GGGCATCAAA AAGTACCCGA
TATAAAAACA AATCGATCAT CGTATTGGGC GTTTGCATCT GGCTATTTAA TTTAAGTATT
CTTTTAAATG CAGCTTCAGG TATTTTTAAC TGTTTTTATT TTAAGCTGGC AGTGGTTCAG
CTATTGGCGA AAATGACGAT TGAATTGTTA TTTTTATACG ATGTAACCGC CTTTGCCAAA
AGACGGAACC TGCTGATATT ATTGCCTGTA CTTAATGTAT TGCATATTTT ATACATTGTA
TACATAGGCG TTGCCGGAAA TACCGGAAAG TATAACTGGA AAGATAGAAT GGTAAAATAA
 
Protein sequence
MVVITFIRGW HKLINFKPQP VILTTKVSVL VAARNEAPNI AKTIEDLIAQ NYARDLTEII 
FIDDHSTDQT AAIISSYSDK GVKLIRLNED QPLNSYKKKA IQTAIAQARG SLIITTDADC
RMGPDWLKTI VSFYEEKKYK MISSPVAYFE EKSFFEKAQT LEFLYLIGLG ASTIGNRKPS
TCNGANLAYE REAFYEVGGF KGIDDLASGD DELLLHKMAA KFNGHIGFLK NEDAVVYTHP
KATLKEFIQQ RKRWASKSTR YKNKSIIVLG VCIWLFNLSI LLNAASGIFN CFYFKLAVVQ
LLAKMTIELL FLYDVTAFAK RRNLLILLPV LNVLHILYIV YIGVAGNTGK YNWKDRMVK