Gene Phep_0503 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0503 
Symbol 
ID8251590 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp604706 
End bp606001 
Gene Length1296 bp 
Protein Length431 aa 
Translation table11 
GC content45% 
IMG OID644934153 
Producthypothetical protein 
Protein accessionYP_003090789 
Protein GI255530417 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.00153136 
Plasmid hitchhikingYes 
Plasmid clonabilityhitchhiker 
 

Fosmid Coverage information

Num covering fosmid clones26 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAT ATACATTTAT AATGGCGGTA TTGGTTTTAT GTACCTTTTT CGCCTGCAAA 
GAGGAAATTG ACAAGCCGGG CGGTAAATGG TCTTCCGGAC CGATAGAGGT TGATCAGATC
ACTCCTATTA ACGGGGGAGC GACGATTACC TACAAACTCC CGAACGACCC CAACTTACTT
TACGTGATGG CAGAATATGA GCGGAACGGG AAAATGTTTA CCGAGAAATC CTCTGTTCAT
AAAAATATGC TTACGATCGA AGGATTTAAC ACAATGGGCA AAGTGCCATT CCGCCTGTAC
ATGGTTAATA AACAGGAACA GCGTTCAGAG CCTCTTGAAT TGGCGTTTGA GCCCCTGGAG
TCACTGATTA GCATTGCCAG GAATTCCTTG AATATGGTGC CGGGATTTGG CGGTATTGTT
GCGGATTGGA GTAATCCAAA AAAGACAGAA TTTGGTGTCC GGTTGATGGT TAAAGATGAA
AAAGGCGCTC TGGTAACCAA GGAAATGTAT TTTTCCTCGA GCGAAAAGGA ACGCCGTTCG
TTTCGGGGCT TTAATCCAAA AGAGACTACT TTTGCGATCG CTTTTGAAGA TAAGTGGGGA
AATATATCGG ACACGACATT GCTTAAGACA ACTCCATTTT TTGAGACGGT GGTACCGAAA
CCATATGCGG ATTTTCGGTC AAGTATCCCT TATGACAATG TCACAAATCT TTCTGGCAGA
AACATTAGTT CACTATGGGA CAACATTGTT AATACTGCTT CTCACGGATG GTTGACCCAA
TCCGGAGGCG GTAGCGGCAT ATCTATGACT ATTGATCTCA AACAAGTTGT CAAGCTCAGT
CGGATTGTTA TTCATGGTTA TCATGTCAAC TCCGTTTACG GACAAGCAAA TATTACTCAA
TTTGAGGCAT GGGGATTAAA AAAAATAGAT TTTGCCAAAT TAGCTGACCG TCCGTACTGG
CTGGACGAGT ATGCCGTTCG CAATAAGAAC ATCAGCGGCC TCGATGGGAT TACCGAAACC
ACCCAGCTTC CTGCCCGGAC ATTTAAAGAT GACTGGCAAT ATTTGGGTTG GCATGCCATA
CCTCGCTATG ACAGAATGGT TCCGCCAGAT CCCCAGGGCG CATTAAACCT TGCAGCAAAC
GGTACGGAAT ACGAAATTCC ACTGGAGGCC GGCCCCGTGC GATATATTCG GATATTTGTA
CGTGAAATTG CAGGCGCAAT GCCACCTCCC GTAAACAACT ATTGGTCTAT GGGAGAGATT
ACTGTTTATG GCGATAACAC AGTTCCACAA CAATAG
 
Protein sequence
MKKYTFIMAV LVLCTFFACK EEIDKPGGKW SSGPIEVDQI TPINGGATIT YKLPNDPNLL 
YVMAEYERNG KMFTEKSSVH KNMLTIEGFN TMGKVPFRLY MVNKQEQRSE PLELAFEPLE
SLISIARNSL NMVPGFGGIV ADWSNPKKTE FGVRLMVKDE KGALVTKEMY FSSSEKERRS
FRGFNPKETT FAIAFEDKWG NISDTTLLKT TPFFETVVPK PYADFRSSIP YDNVTNLSGR
NISSLWDNIV NTASHGWLTQ SGGGSGISMT IDLKQVVKLS RIVIHGYHVN SVYGQANITQ
FEAWGLKKID FAKLADRPYW LDEYAVRNKN ISGLDGITET TQLPARTFKD DWQYLGWHAI
PRYDRMVPPD PQGALNLAAN GTEYEIPLEA GPVRYIRIFV REIAGAMPPP VNNYWSMGEI
TVYGDNTVPQ Q