Gene Phep_2021 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2021 
Symbol 
ID8253125 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2328702 
End bp2329868 
Gene Length1167 bp 
Protein Length388 aa 
Translation table11 
GC content42% 
IMG OID644935669 
Productglycosyl transferase group 1 
Protein accessionYP_003092288 
Protein GI255531916 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG0438] Glycosyltransferase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones15 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.0022628 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGAGGA TAGTTATAGC CAGTACCTGC GCAGAGGACT GGGGTGGCAG CGAAGAATTA 
TGGGGAAGAA GTGTTCCGCT GTTACAAGAG TCTGGTTTTC ATATTACAGT AATAAAATAT
TATATCAACA GGGCTCATCC GGAATTTATC AGACTTGCTG AAAGGGGCGT TAATCTGCTC
GACATTTTCC CAAAAGGTAC AATAGCAAAA AGAGTTTATA AAAAGAGCTT AAAATTAGTC
AATCAGGTGG CGCTGAAACT GAAATTGACC TCAGATCAGG GTGAGGATTT TAGCGCTTTC
ATCAAGATCA TGCAGGACAC CAAACCAGCA CTGGTGATCA TCTCACAGGG AATAAATTTT
GACGGCCTGA AACTTGCTTA CCAATGTTCC CTCCTTAAAA TACCCTATGT GGTCATTTCC
CAAAAGGCCG TAGATTTTTA CTGGCCCCAC AAGGATGATC GTGCCTTTAT GCTCAAAGCA
CTGGAAAAAG CGGAAAAATG TTTCTTTGTA TCACACCATA ACCTGCGGTT AACAGAAGAA
CAATTTGGTA AAAGACTACC CAACGGGCAG GTCATTTTTA ATCCGGTAAA GCTATCAGGA
AACATTGTAC CATTTCCTAA ATCTAAAGAA CCATACAAAT TAGCTTGTTT AGGCAGGCTT
TTTTTGCTGG ATAAAGGACA GGATCTATTG ATCCGTATAC TTTCCGAGCA AAAATGGAGA
GATCGTCCTG TAAAAGTATC CTTTATTGGA AAAGGTACAG ACGAGGCAGC TTTAAAAGAT
ATGGCTAAAC TGTTAAATGT CACCAACGTC GATTTTCTGG GACAGATTGA GGATATTGAA
GCGATGTGGG AAGATTATCA TGCCCTTGTT CTTCCTTCCA GAAGTGAAGG TTTACCTCTA
TCTATGGTCG AGGCCATGTC TGCTGGCAGG CCCGTAATCA TATCCAATGC AGGCGGGAAT
GCAGAACTGG TAGAAGAAGG TGTTACCGCT TTTATCGGTC ACGCCAATGA AGAATCATTC
GGGGAGGCAA TGGAACGTGC CTGGCATAAA AGAGAAGAAT GGGAAGAGAT CGGAAAAAAC
GGGGCTAAAC ATGTTGCAGA AAACGTCCCA AAATCACCTG AAACAGAGTT CGCAAAGCTA
ATTGTTGAGC TTCTTGCAAA TAAATAA
 
Protein sequence
MKRIVIASTC AEDWGGSEEL WGRSVPLLQE SGFHITVIKY YINRAHPEFI RLAERGVNLL 
DIFPKGTIAK RVYKKSLKLV NQVALKLKLT SDQGEDFSAF IKIMQDTKPA LVIISQGINF
DGLKLAYQCS LLKIPYVVIS QKAVDFYWPH KDDRAFMLKA LEKAEKCFFV SHHNLRLTEE
QFGKRLPNGQ VIFNPVKLSG NIVPFPKSKE PYKLACLGRL FLLDKGQDLL IRILSEQKWR
DRPVKVSFIG KGTDEAALKD MAKLLNVTNV DFLGQIEDIE AMWEDYHALV LPSRSEGLPL
SMVEAMSAGR PVIISNAGGN AELVEEGVTA FIGHANEESF GEAMERAWHK REEWEEIGKN
GAKHVAENVP KSPETEFAKL IVELLANK