Gene Phep_0202 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0202 
Symbol 
ID8251287 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp233558 
End bp235402 
Gene Length1845 bp 
Protein Length614 aa 
Translation table11 
GC content44% 
IMG OID644933852 
ProductATP-binding region ATPase domain protein 
Protein accessionYP_003090490 
Protein GI255530118 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones21 
Fosmid unclonability p-value0.331744 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAAAAAA GGAGCCTTTG GCTAATTACC GGACTCATGA CCGTAGCATT GCTCGGTGTG 
TTTGTAATGC AATTGTATTA CATCAGGCAG GCCTATAACC TAAATTCCCA GCTATTTGAA
CAGGATGTGA ACCAGGCATT AAATGCGGTG GTTACCAAAG TACAAAAACG CAATGCAGCC
TTGCACATCA GCAGGAAGGA CAGGCAAATG GAGCTGGAAA GGAAGGCCGA GGTGCAGGAC
CAGGCCAAAC AGCTGGTAGA ATATAAGGAC CGCTTTAAAG AAGAAGCGCA AAAGCGCGTA
CTGGAACGGC AGCGGCTCAT TGTAGCCGAC CTGAACCAGA CCGATCTTGA GATCAGAAAG
TCATACGCCA ATCCCATCCT GATCTCTGAA GATGAATTTC GTGCTGCCTC CGATCTGGGT
AATCCCAATA AATCGCCGGT GCATGTAGAC GTAAATGTAG GGCTTGATGC TTTTGGAAAT
GTAGTTGCCG GACAGATCAA ATCGACCTTT GTACAGGAAA GACCGAAGCT TTTTAATGTA
GCGCAGACTA AACTGCCAGA TAGTATCCAT TACCTGGCAG TTGATCCCGG TACCGGGAAA
AATGTGCTGA TCAGTGTAAG AACAGTGAGT GCTGAACTGG AAAGGAAATT TATCCTGGAA
GATAACCTGG CCAAAAGGAA GTACGATGAA GGCTTGAAAA GGTTGATGTC CGATACCATT
CCAATGAAGC CGGGGGCCGA TATCCTGTTG CAGGACGTGG CCAAGGAGAT GCGCGATGCC
AATGTGCCGC TGAGTAAAAT GATTGCTAAA GATGTGCTGG ATACCCTCAT CAAGAAGGAG
CTGCTGAACA GGAACATTAC ACTCAAATAT GATTTCTGGG TAGGCCTGGC ACAAAAGGAT
TCCCTGGTGT ATCGGAAAGC GGCCAACACA ACCGGAGAAT TATTACCTAA AAATACCTAT
AAAGCAACAC TTTTTAATTC TGTGATTCGC GATCCGGGTA TGTTATACCT GTATTTTCCA
AATAAAAACT CACTCATATT TGCTAATCTA AGTGTAACGA TGGCCTCTTC TGCGGGATTG
TTGTTGGTGC TGATCTTCAT TTTTGCCTAT ACCATTTACA CCATTCTGAG GCAGAAAAAG
ATTTCTGAAA TGAAGACGGA CTTTATCAAC AACATGACCC ATGAATTTAA AACACCCGTA
GCCACCATTA TGATTGCCAG TGAGGCCCTT AAGGATCCGG AGATTGTAGC GGATAAAAGC
CGCATCAGCA GGCTCGCCGG AATTATCTAT GATGAGAATG TACGTTTGGG TAACCACATA
GAAAGGGTGC TGAGCATTGC ACGACTGGAA AAGAAAGAGT TAAAACTGGA GCATAAGGAA
GTGAATATCC ACGACCTGAT CACGGCGGTG GTAGACAGCA TGAACCTGCA GCTGCAAAAG
AAAAATGCCC GGGTAACCTT AAACCTGGAA GCCCTTCAGC CAGTTATTTT GGGTGATGAG
CTGCACCTGT CCAATGTATT TTATAATCTG GTAGACAATG CCAATAAATA CAGTTCGGAA
AATCCTAAAA TTACCATCAC CACCAGGAAT ACTGACAAAG CATTGCACAT TGAAATTGCC
GATGAAGGTA TTGGCATGAC CAAAGAACAC AGCAAACGTA TTTTTGACCA GTTTTATAGG
GTGCCAACCG GTAACCTGCA CGATGTAAAA GGTTTTGGCC TTGGGCTTAA CTATGTGCAG
GACATCATCG AGCAAATGAA CGGCTCTATA AAAGTACAGA GTGAAAAAGA TAAAGGAACT
ACCTTTGAAA TCAATTTACC GCTAAACCAT AACCACAGCA AATGA
 
Protein sequence
MKKRSLWLIT GLMTVALLGV FVMQLYYIRQ AYNLNSQLFE QDVNQALNAV VTKVQKRNAA 
LHISRKDRQM ELERKAEVQD QAKQLVEYKD RFKEEAQKRV LERQRLIVAD LNQTDLEIRK
SYANPILISE DEFRAASDLG NPNKSPVHVD VNVGLDAFGN VVAGQIKSTF VQERPKLFNV
AQTKLPDSIH YLAVDPGTGK NVLISVRTVS AELERKFILE DNLAKRKYDE GLKRLMSDTI
PMKPGADILL QDVAKEMRDA NVPLSKMIAK DVLDTLIKKE LLNRNITLKY DFWVGLAQKD
SLVYRKAANT TGELLPKNTY KATLFNSVIR DPGMLYLYFP NKNSLIFANL SVTMASSAGL
LLVLIFIFAY TIYTILRQKK ISEMKTDFIN NMTHEFKTPV ATIMIASEAL KDPEIVADKS
RISRLAGIIY DENVRLGNHI ERVLSIARLE KKELKLEHKE VNIHDLITAV VDSMNLQLQK
KNARVTLNLE ALQPVILGDE LHLSNVFYNL VDNANKYSSE NPKITITTRN TDKALHIEIA
DEGIGMTKEH SKRIFDQFYR VPTGNLHDVK GFGLGLNYVQ DIIEQMNGSI KVQSEKDKGT
TFEINLPLNH NHSK