Gene Phep_1839 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1839 
Symbol 
ID8252942 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2128556 
End bp2130163 
Gene Length1608 bp 
Protein Length535 aa 
Translation table11 
GC content46% 
IMG OID644935489 
ProductAlpha-galactosidase 
Protein accessionYP_003092109 
Protein GI255531737 
COG category[G] Carbohydrate transport and metabolism 
COG ID[COG3345] Alpha-galactosidase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000186405 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAACAAT TTATAACAGG CTGCTTACTT TCCTTTGGCA TTTTATGTCT GACCAATCAC 
CTGGTAAAGG CGCAGGGCGC TCCGGATACC TTAAAAAAAT ATATCCTTAC GCCTGCCCCA
CCCCAAACTC CGCGCATCAA CGGGGCCAGA ATATTTGGGC TCCGTCCAGG CTCTGCCTTC
CTTTATACCA TCCCTGCAAC AGGCATTCGC CCGATGCACT TCGGAGCTTT GAATTTGCCA
AAAGGTTTAA CCGTAGACCC CGGTTCTGGC CGGATAACGG GAAAAATAAC AGAACGCGGG
GAATATGAAG TAACCCTGAC CGCAAAAAAT TCATTAGGAG AATCTAAACG GACATTTAAG
ATAGTAGTGG GTGATCAGAT AGCCCTAACA CCTCCAATGG GCTGGAATAG CTGGAATTGC
TGGGGCGATG CCGTAAGCCA GGAAAAGGTA TTGAGTTCGG CCAAAGCAAT GGTAGAAAAA
GGCCTGCTGA ATTATGGCTG GCAATACATC AATATAGACG ATGGCTGGCA GGGACTTCGT
GGTGGAAAAT ACAATGCCAT TCAATGTAAC AGCAAATTTC CTGATATGAA GGGTCTTGCC
GATGAAGTAC ACAGGATGGG ACTTAAAATA GGAATTTACT CTGGTCCCTG GGTAGGAACC
TATGCCGGGC ATCTCGGGGC TTATTCTGAC AATGCCGATG GTACGTACGA CTGGGTGAAA
CAAGGGAAAC ACAATGAATT TTACCGTTTT GCTGATCCTG AGAAAAAGGA AAAGCATGGC
ATAAACTACC ACCACGGCAA ATATTCATTT GTGAAAAATG ACGTACAGCA ATGGATGGAC
TGGGGAATGG ATTACCTGAA ATACGATTGG AACCCCAACG ATGTATACCA TGTAAAAGAA
ATGAAGGACG CATTACGTTC TTATAAACGG GATGTAGTAT ACAGTTTGTC TAACAGTGCC
CCTTACGGAG ATGCCACACA ATGGGAAAAA ATGGCCAATA GCTGGAGGAC TACCGGTGAT
ATCAGAGACA CCTGGGAGCG GATGTGCCAG CTTGGCTTTA ATCAAACCAA ATGGGCCCCT
TTTGCCGGTC CCGGACATTG GATAGACCCG GATATGCTGG TAGTAGGGAT GGTAGGCTGG
GGACCTAAAC TACATTATAC AAAGCTAACT GCTGATGAAC AATACACGCA CATCAGTTTA
TGGTGTTTAC TCGCTTCTCC CCTGTTAATT GGCTGTGATA TGGCCCAGCT GGATGACTTC
ACCATCAGTT TGCTAACCAA CAACGAGGTG ATTGATGTAA ACCAGGATCC AATGGGCAAG
TTTGGTATGC TGGTCGCTGA AAATGGGGAA ACAGTGGTAT ATGCCAAACC GCTGGAGGAT
GGTTCAATGG CTGTTGGTCT GTTTAACCGT GGACAAAAAT CAGAAAAGAT CACTGTCAAC
TGGAAAACCC TGGGATTAAG GGGCGAACAA ACGGTTCGTG ATCTATGGAG ACAGCAGGAC
GTTGCCAAAT CCGATCAGGA ATTTTCATCA GAAGTGAACC CGCATGGTGT CCGTTTTATA
AAAGTATATC CTGGAAACAG CAGAACACAG GCAACTTCCG GAAAATAA
 
Protein sequence
MKQFITGCLL SFGILCLTNH LVKAQGAPDT LKKYILTPAP PQTPRINGAR IFGLRPGSAF 
LYTIPATGIR PMHFGALNLP KGLTVDPGSG RITGKITERG EYEVTLTAKN SLGESKRTFK
IVVGDQIALT PPMGWNSWNC WGDAVSQEKV LSSAKAMVEK GLLNYGWQYI NIDDGWQGLR
GGKYNAIQCN SKFPDMKGLA DEVHRMGLKI GIYSGPWVGT YAGHLGAYSD NADGTYDWVK
QGKHNEFYRF ADPEKKEKHG INYHHGKYSF VKNDVQQWMD WGMDYLKYDW NPNDVYHVKE
MKDALRSYKR DVVYSLSNSA PYGDATQWEK MANSWRTTGD IRDTWERMCQ LGFNQTKWAP
FAGPGHWIDP DMLVVGMVGW GPKLHYTKLT ADEQYTHISL WCLLASPLLI GCDMAQLDDF
TISLLTNNEV IDVNQDPMGK FGMLVAENGE TVVYAKPLED GSMAVGLFNR GQKSEKITVN
WKTLGLRGEQ TVRDLWRQQD VAKSDQEFSS EVNPHGVRFI KVYPGNSRTQ ATSGK