Gene Phep_3953 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_3953 
Symbol 
ID8255087 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp4754365 
End bp4756155 
Gene Length1791 bp 
Protein Length596 aa 
Translation table11 
GC content44% 
IMG OID644937617 
Producthypothetical protein 
Protein accessionYP_003094206 
Protein GI255533834 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.584499 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.585033 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAATC ATCTTCTACT TTCTGCCCTT ACGCTCCTGC TATTGTCTTT TGGCAGCTGT 
AAAAAAAATA ATACCGGTGC TCCGGAGCCC CCAACAACAG ATCCTGTAGT TTCGGATGCA
GATTTTGCGA AAAACTTTGG CGATGCGGCT TCCAGGAGCT TCATCGTTGA AGTTTTCGAT
GCAACAAATA ACCCTGTAAA TGGGGCTGAT GTTAAAATTG GCAGTATTAC TGCACAGACT
GATGCCAAAG GTGTAGCTGT GATTAAAAAT GCTGCTGTTT TTGAGCGATT TGCTTATGCA
ACGGTAAAGA AAGCCGGTTA CATAGACGGT TCAAGGGCCT TATCGCCAGC AGCAGGAACC
AGTAGCATTA AAATAAAGCT GTTTTCGGAT AAAGTTACGG CTACTGTAAA TTCCGGTGCA
GTTAGCAATG TGAGTTTGCC TAATGGTGTG AAGATTACAT TTGACGGCAA TTTTAAAACA
GAAACCGGAA CCCCATACAC CGGGCAGGTA AATGTAATTT TAAATGGTTT AGAGGCTTCG
GATCCGGATT TGTTTAGGAA AATGCCAGGA ATGCTGTTTG CCCGCAATGC TGCTGGTGAT
GCCAGATTGC TGGAAACTTA TGGAATGTTG AATGTGGAAC TGAGGGGAAG TGCGGGACAA
AAGTTGCAAA TTGCCAATAA GGCCCAGATA GAGATGAATA TTACCAGCGC CCAGTTGAGC
ACTGCAGCTG CAACCATGCC GCTTTGGCAT TTTGATGAGG CACTGGGTTA TTGGAAAGAG
GAGGGCCTGG CCAATAGGGT TGGAAATAAA TATGTTGGAG AGGTTGCCCA TTTCTCGTGG
TGGAATTTTG ATGCGCCAAT CCAGTCTCCA ATTGTACAGC TGCAGATTAA ACTTGTGGAT
GCGAATGCCC ATCCCTTGGC AAATGTTAAA ACGGTATTGT TCAGACAAGG TGGTTCTGCT
TCAATAGCTT CCGAAACAGC GGTGAACGGA ACTGCAAATG GCCCTGTGCC AGCCAACGAG
GTTTTTACTT TAAAAGCTTA TGATGCTTGT GGCAATGTGG TTTGTACGCA GAATATAGGC
CCTTTTACGG TAAATACGAC TTTGCCTGAT ATTGTGCTGA ATATGCCTGC AACGCAGTAT
ACTACGATTA GCGGTACTTT AAAAAAATGC GACAATACAA ATGTTACCAA TGGGTATGTT
GTAGTTAATT ATGGTCAGCA GACTTTTGCT ACCTTGGTGA TGAACGGTGC TTTCAGCTTT
CAGACTATAA TATGCGGGTC GAATGGGATG ACCATAATTG CTGAAGATGC GGACAGTCAT
CAGAATACTG GTACATTGAA TTATACTTTG AGCTCACCGG TAACCAATAT TGGCAATATA
CTGGCCTGTA ATACCAGTTC GGAATCTATT AGCTATAGCA TTGATGGGGG AGCACCGAAA
ATCATAACGG TAAATATAAA TGCCTCATCC TCTTCCGGTG CATTTACGAT CAGCGGCGGC
AGTCCGGTTC AAATGAATGA TATTTCTATC ACCGGTAATA CTTCAATGCC TGGCATTTAT
AGCTCTGCAT CGGGAATACA GATTATTGGC AACGGATTGA CTACCAGTCT GGGTTCGGCA
CTTAACCAGT TTGCGATCAC TTATAACTTG GCGAATGTAG GTGGCGTAGG ACAATATATT
GATGTTAGTT TTAGCGGAAC TTATCAGGAA GTGGTTATGA CAGGGATGAG CCAGGGATAC
AATTTATCGC ACACGATTTC GGGTACAGCG CATGTGTTAA GGGATAATTG A
 
Protein sequence
MKNHLLLSAL TLLLLSFGSC KKNNTGAPEP PTTDPVVSDA DFAKNFGDAA SRSFIVEVFD 
ATNNPVNGAD VKIGSITAQT DAKGVAVIKN AAVFERFAYA TVKKAGYIDG SRALSPAAGT
SSIKIKLFSD KVTATVNSGA VSNVSLPNGV KITFDGNFKT ETGTPYTGQV NVILNGLEAS
DPDLFRKMPG MLFARNAAGD ARLLETYGML NVELRGSAGQ KLQIANKAQI EMNITSAQLS
TAAATMPLWH FDEALGYWKE EGLANRVGNK YVGEVAHFSW WNFDAPIQSP IVQLQIKLVD
ANAHPLANVK TVLFRQGGSA SIASETAVNG TANGPVPANE VFTLKAYDAC GNVVCTQNIG
PFTVNTTLPD IVLNMPATQY TTISGTLKKC DNTNVTNGYV VVNYGQQTFA TLVMNGAFSF
QTIICGSNGM TIIAEDADSH QNTGTLNYTL SSPVTNIGNI LACNTSSESI SYSIDGGAPK
IITVNINASS SSGAFTISGG SPVQMNDISI TGNTSMPGIY SSASGIQIIG NGLTTSLGSA
LNQFAITYNL ANVGGVGQYI DVSFSGTYQE VVMTGMSQGY NLSHTISGTA HVLRDN