Gene Phep_2024 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2024 
Symbol 
ID8253128 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2332965 
End bp2333966 
Gene Length1002 bp 
Protein Length333 aa 
Translation table11 
GC content39% 
IMG OID644935672 
Productglycosyl transferase family 2 
Protein accessionYP_003092291 
Protein GI255531919 
COG category[M] Cell wall/membrane/envelope biogenesis 
COG ID[COG1215] Glycosyltransferases, probably involved in cell wall biogenesis 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00394379 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGCTCGAGC GCACCCCGAA ATCACCACCT AAGATTGAGC CTCTATCTTT AAATGTCTCC 
AGGCCAAAAT GGTCAGTAAT GATTCCTTCT TATAATTGCA TCCATTATTT ACGTAAAACT
ATAGAAAGCG TTTTATTACA AGCCCCTTCT GCAGAAGAAA TGCAGATAGA AGTTATTGAT
GATTTCAGTA CTGACGGGGA TGTTGAGGCT CTGGTAAATG AAATTGGTAA AGGGCGAGTA
GGTTTTTATA AACAAACGCG GAATGTAGGT AGCCTGCGTA ATTTTGAAAC CTGTATCAAT
AGATCCATCG GTACATTGGT TCATATTTTG CATGGGGATG ATCTGGTTAA GCCTGGATTT
TATGAAGAAA TAGATGCTTT ATTTAAGAGC TATCCTGAAA TTGGGGCGGC ATTTACCGGT
TGTACAGATT TCGATGAAAA CGATAAAGAA ATTTGGGACA GTAAAATCAT CCTTCCTGAA
CCTGGAATAA TTGACAACTG GCTACTAAAA ATAGCGCAAG GGCAGCTGCT GCAAACCCCT
TGTATTGTAG TTAAACGTAC CGTATACGAA CATCTGGGTA GTTTTTTTGG TGTCCATTAC
GGAGAAGATT GGGAGATGTG GACACGAATT GCGGCTCACT ATCCGGTTGC TTATTCTCCT
AAGCCACTTG CATTTTACAG AGTTCACAAT AATAATATCA CCAGTAATTC TTTCCGAACA
GGGCAAAATA TAAAAGACAT TTCTGCGGTG ATCGACACCA TTCAAAACTA TCTTCCAATA
AAGGAAAGAA AAAAACTGAA AAGAAAAGCA AGGGAAAAAT ATGCCTACTA CATCACCATG
ATAGCCGATG GCCTTTACCA CAATAATACA GATTCAAAAC CGGCATTATT ACAAACGGCT
AAGGCCGTTC GGTTACATCC GAGCAAACAT ACCTTATATT ATTTTTTAAA GATCTCTGTT
AAAGTGTTGA TCCGTTATAA AGCTAAACAT CAAAGGAAAT AA
 
Protein sequence
MLERTPKSPP KIEPLSLNVS RPKWSVMIPS YNCIHYLRKT IESVLLQAPS AEEMQIEVID 
DFSTDGDVEA LVNEIGKGRV GFYKQTRNVG SLRNFETCIN RSIGTLVHIL HGDDLVKPGF
YEEIDALFKS YPEIGAAFTG CTDFDENDKE IWDSKIILPE PGIIDNWLLK IAQGQLLQTP
CIVVKRTVYE HLGSFFGVHY GEDWEMWTRI AAHYPVAYSP KPLAFYRVHN NNITSNSFRT
GQNIKDISAV IDTIQNYLPI KERKKLKRKA REKYAYYITM IADGLYHNNT DSKPALLQTA
KAVRLHPSKH TLYYFLKISV KVLIRYKAKH QRK