Gene Phep_1459 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1459 
Symbol 
ID8252560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1733715 
End bp1735595 
Gene Length1881 bp 
Protein Length626 aa 
Translation table11 
GC content46% 
IMG OID644935113 
ProductHeparinase II/III family protein 
Protein accessionYP_003091735 
Protein GI255531363 
COG category 
COG ID 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.230111 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones23 
Fosmid unclonability p-value0.696506 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAACTT TTTTTACAGC CCTGTTTTTT CTGCTCGCCT GCTGTGCTGC TGCACAAAAT 
AAACTGTCCG GAAAAACAAA CTTCCCGGCC CATCCCAGGT TGCTCCTGCT TGCCGGAGAA
GAAGAAAACA TTAAAAAAAC CATTAAAAAT GACGTTCTGC TGCAAAACGT ACACCATGGC
ATACTGGCCA AATGCGATCA GTTTTTAAAG GATGCACCGG TAGAAAGGAT CAAGATCGGA
AGACGCTTAC TTGATAAATC AAGAACCTGC CTGCAAAGAG TTTTCTATCT TTCTTATGCA
TGGAGAACGA CCCATCAGCA AAAATACCTG CAGAGGGCCG AGAAGGAAAT GCTGGCCGCA
GCTGCTTTTG AAGACTGGAA CCCCGACCAT TTCCTGGATG TGGCAGAGAT GACCCTGGCA
TTGGCCATTG GGTACGACTG GCTTTACCAG GAACTGCCCG AAAGCTCCAG AATAGCTATA
AAAACAGCCA TCATCACCAA AGGGCTTCAG CCTTCGATGC TAAATGGCAG CAATATGTTC
TGGCTTAAAG CTGCACACAA CTGGAACCAG GTTTGCAATG CCGGACTTTC ATTTGGTGCA
ATGGCAGTTT ATGAAGAACA ACCTGAACTT GCCCAAACGA TCATTAACCG CGCCATCAAT
ACCATTGAAC TCCCTATGAA GATCTATGAA CCCGACGGGG CCTATCCCGA AGGTTATGGT
TACTGGAATT ATGGGACAAC TTTTAACGTA CTGCTGATCA GTGCCTTTCA AAAAGCCCTT
GCAACCGATT TCGGGCTTTC AGAACAACCA GGCTTTTTAA AGACCGCAGG CTTTCTTTTA
AACATGACCG GACCAACCGG CCAGCCTTTC AATTACTTTG ATTCCGGCAC TGGCGGAGAA
ATTAACCCGG CAATGTTCTG GTTTGCAGCA AAACTAAAGG ATCCTTCGCT CTTGTTTACT
GAAAGGACTT ACCTGGCTAA ACAAGTTAAA CATCTTACTG ACGACCGCCT CCTGCCTGCC
CTGCTGATCT GGAGCAGTGG TATCCAGCTG AATAAAATCC CGGCTCCGAA GAAAACGATG
TGGACAGGAA AAGGCAGTAA CCCAGTAGCG ATGATGCGCA GTTCATGGAC AGATCCCAAT
GCCCTGTATA TTGGCATGAA AGGTGGCTCC CCTTCTGTTA ACCACGGCCA CATGGATGTA
GGCTCTTTTG TAATGGAAGC AGATGGGGTA AGATGGGCCA GCGATTTTGG TATGCAGGGT
TATGAATCTT TGGAAGCTAA AGGGATCGAC CTATGGAATA TGAAACAAAA CTCCCAACGC
TGGCAGGTAC TGAGGTACAA TAACCTGTAC CACAACACAC TCAGCTTTAA CAACGAATTC
CAAGATGTGG ATGGTTACGC CCCTATTGTC AGTTATTCGG ACAATCCGGC TTATATGAAC
ACGGTGGTAG ACATCAGTTC TGTTTATAAA ACACAGCTTT CTAAAGCCAT CCGCGGAATT
GCCCTTGTTA ACAAGCAGTA TGTGGTGGTT CGGGATGAAC TGGAAGGCGG TGCCAAGGCA
ACAAAGGTAC GCTGGGCTAT GCTTACTGCT GCTGATGTAA AAATCATTGG CCCCAACGAG
GCGGAGCTGA GTAAAAATGG CAAAAAGCTA TATTTAAAAG TACAGGAACC TGTTCATATC
CAGCTTAAAA CATGGTCTAC CGCCCCAACA ACAGCTTATG ATGCACCCAA TCCGGGCACG
GTAATGCTTG GCTTTGAAAC CACTTTGCCT GAAAAATCAC CAGCCGTACT TACGGTATTG
TTAATCCCTC AGGCTCAGAA AAAAAATAGC ATACAAAAAA CGAAACCCAT AGCACAATGG
GAAAAAAAAC CATTTCGATA A
 
Protein sequence
MKTFFTALFF LLACCAAAQN KLSGKTNFPA HPRLLLLAGE EENIKKTIKN DVLLQNVHHG 
ILAKCDQFLK DAPVERIKIG RRLLDKSRTC LQRVFYLSYA WRTTHQQKYL QRAEKEMLAA
AAFEDWNPDH FLDVAEMTLA LAIGYDWLYQ ELPESSRIAI KTAIITKGLQ PSMLNGSNMF
WLKAAHNWNQ VCNAGLSFGA MAVYEEQPEL AQTIINRAIN TIELPMKIYE PDGAYPEGYG
YWNYGTTFNV LLISAFQKAL ATDFGLSEQP GFLKTAGFLL NMTGPTGQPF NYFDSGTGGE
INPAMFWFAA KLKDPSLLFT ERTYLAKQVK HLTDDRLLPA LLIWSSGIQL NKIPAPKKTM
WTGKGSNPVA MMRSSWTDPN ALYIGMKGGS PSVNHGHMDV GSFVMEADGV RWASDFGMQG
YESLEAKGID LWNMKQNSQR WQVLRYNNLY HNTLSFNNEF QDVDGYAPIV SYSDNPAYMN
TVVDISSVYK TQLSKAIRGI ALVNKQYVVV RDELEGGAKA TKVRWAMLTA ADVKIIGPNE
AELSKNGKKL YLKVQEPVHI QLKTWSTAPT TAYDAPNPGT VMLGFETTLP EKSPAVLTVL
LIPQAQKKNS IQKTKPIAQW EKKPFR