Gene Phep_1840 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1840 
Symbol 
ID8252943 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2130175 
End bp2131791 
Gene Length1617 bp 
Protein Length538 aa 
Translation table11 
GC content45% 
IMG OID644935490 
Producthypothetical protein 
Protein accessionYP_003092110 
Protein GI255531738 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.511064 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones
Fosmid unclonability p-value0.00000982396 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAAGCATA TCGATTCGCG AATTCCACAG GTAAAAATGG CTTTAATGCT ATTTTTACTA 
TTAACCAGCA CATTTGTACA AGCCAAAGCA TCCAAAAAGG ATAAAGATCC AGGAAAAAGT
GAATGGATTT ATTTTGGGGC TGACGGCAAG CTGGTTTATA AAAAAAGTGC CAAGGGCGAC
CGCATAATGG ATTTCTCTCA TGCTGGCTAT ATGGGCGGTG GTGTGGCATT ACCAGATGTT
ATTGTAAAAG TTACGGTTAA GCCACCGGGC GATGTGAATG CAGATTGTAC AGCCTTAATT
CAGGCTGCTA TCGACAAGGT ATCGGCACTG CCATTGGATA AAAATGGTTT CCGCGGGGCT
GTTTTACTGG CACCTGGTAC CTACCCTTGT GCCAAAACCA TAAAAATAAC AGCAGATGGT
GTGGTACTCC GCGGAAGCGG CAAATCGGAA AACGGAAGTA TTATTGCCAT GAATGGTGAA
AAACATACTG CTGTAATCCT ATCCAACGGC CTTAATCAAC GTGCCGGCAA TCGACTGGGT
AATGCTGCAG GCAATGAAAA AACGGTTAAA ATAACAGATA AATATATCCC TGCCGGAAGC
CTGTCTTTTA ATGTAGCAAA TGCAGCTGGA TTTAAGGTTG GTGATAATGT TGAGATCAGA
AAACCGGTTA CCGACAAGTG GGTAAGCTTC ATGCACATGG ACGATCTGGT ACGGGATGGC
AAGGCGCAAA CATGGATTAA AACAGGCTCT TTATTGATCA CCGAACGTCA CATATCAGCT
ATAAACGGCA ATAAAATTAC TTTAGACGTT CCTTTGGTAG ATTCGTATGA TGTAAATTAT
ACCAATGATG AAACCACAAT GGTATTGGCC AATGATGTAA AACGGCTTAA ACAGGCTGGC
TTAGAAAATT TACGCATTGT ATCTCCCCCG CAAGCAGTTA ACCATACAAA GGCACTCTAC
TATGCGGTTA GGCTAAACGG CGAGGATTGT TGGATGAAAG ATCTGGACCT GATGGAAACC
ATGGAAAGCG TAGGAACCGG CGGACGCAGG ATTACACTGC AGCGCATCGC TGTCATCCGT
AAGGCCTTAC ATGATGGTGC ATCTAAACCG GCAGAATTTG CACCCAATGC AGGGCAAATT
CTGGTAGATC GCTGTTCGGT TGAAGGTGAT AACATCTGGT ATGTAGCACT GGGCGCCGGA
CAAACAGGCC CCATCGTTTT TTTGAATTGT AATTTTGTTG GCAATGGCCG CATTGAAGGA
CATCAGCGTT GGAGCACAGG CATGCTTTTA GACAATTGCA AGGCACCAAA TGGTGGTATG
GACTTCAAAA ACAGGGGCAG TATGGGCTCT GGTCATGGCT GGGGAACTGC CTGGTCTGTT
GCCTGGAATT GTGAAGCAGG CAGCTATGTA AATCAGATAC CTCCGGGCAC CTACAATTGG
GTAATTGGCA GTAAAGGAAA AAGTATGCCT TTGCCAAGAC CATTTAACAA TGCTGGCCCG
GCTTTGCCGG AGGGTATTTT TGATGCTCAA AACACCCGGG TCAATCCTTC AAGTTTATAT
CTGGCCCAAC TGGAAGAACG TTTGGGTAAA CAGGCCCTGA AAGCAATAGG GTATTAA
 
Protein sequence
MKHIDSRIPQ VKMALMLFLL LTSTFVQAKA SKKDKDPGKS EWIYFGADGK LVYKKSAKGD 
RIMDFSHAGY MGGGVALPDV IVKVTVKPPG DVNADCTALI QAAIDKVSAL PLDKNGFRGA
VLLAPGTYPC AKTIKITADG VVLRGSGKSE NGSIIAMNGE KHTAVILSNG LNQRAGNRLG
NAAGNEKTVK ITDKYIPAGS LSFNVANAAG FKVGDNVEIR KPVTDKWVSF MHMDDLVRDG
KAQTWIKTGS LLITERHISA INGNKITLDV PLVDSYDVNY TNDETTMVLA NDVKRLKQAG
LENLRIVSPP QAVNHTKALY YAVRLNGEDC WMKDLDLMET MESVGTGGRR ITLQRIAVIR
KALHDGASKP AEFAPNAGQI LVDRCSVEGD NIWYVALGAG QTGPIVFLNC NFVGNGRIEG
HQRWSTGMLL DNCKAPNGGM DFKNRGSMGS GHGWGTAWSV AWNCEAGSYV NQIPPGTYNW
VIGSKGKSMP LPRPFNNAGP ALPEGIFDAQ NTRVNPSSLY LAQLEERLGK QALKAIGY