Gene Phep_0473 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0473 
Symbol 
ID8251560 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp562926 
End bp565085 
Gene Length2160 bp 
Protein Length719 aa 
Translation table11 
GC content37% 
IMG OID644934123 
ProductATP-binding region ATPase domain protein 
Protein accessionYP_003090759 
Protein GI255530387 
COG category[T] Signal transduction mechanisms 
COG ID[COG0642] Signal transduction histidine kinase 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones14 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGAAATCA AATACACTAA AGTATTAGCT CGAAACATCC TGCTTACTTT TCTGGTCTTC 
TCAATTATTT TTGCGGTAGC CGCACTTTTT TTATACAATA ACATTACTAA AAAACTGGCG
GGTATTTCTG AACTGGCCAG CCATATTGAT CGTGAACAGG AAAAACCCGA ACGGGCGCTG
ATCCTGCTCC AGCAAGCAGA AGATGATTTC CAGCAGTACC TTTTAGACAT CAATAGCAAG
AAGGATATAG ATTACAGGAA GAATTTGTCT CTGGCTTTTT ATGAAGTTGA TGCATTGCTT
CATGAAAATA AGAATATTGC CTACTTAACG GTGGCGCAAA GGGATAAAAT GAAACAATTA
AATCAAAAAA GGTTAAAACT ATCGAAAGAA TTTATCCTGC TCAAACATGA TTTTGATTCC
CTATTGTCGG TATATGCTAA ATTTAACGAT AAAACAAACT CAGATCTGAA TGGTAAAAAA
CCAATCCTTT CACCGGCGAA AATAAATCAG ACGGATACCA TTAAAAAGCT AATACCGCTG
GAAAAAAGAA GCTTTTTTAA AAGGATAAAA GATGCGATAC TAAATAGAGA TACTTCAAGA
GCAACCAGGG TTATAGAAAT TCGCCCTAAA GATTCCGGTA GCGCTGCCTT TATCGGCAAA
AATACGATAA ACATTTACGG GCAGGCTTAT GCAAAAAGAT TATTGCAATT GAACAAACAG
AATGCTAAGC TTTTAGATAT GCACAGAAAC CTCATTTCTC TTAGTATTCA TACGAGCAGT
GAACTCAGGC GCATCAATAA TGGTGTCAGA GAGATCAATG CTAATATGGC CGAAAACCTG
AAGAGAATGG CTTTTAAAAA CTACCGTGAA ACTACAGCGC TGCTTACCAA GGTTCATCTT
GCTGCGTTAT TACTTTTGCT ACTGTTCGGG ACCTTACTGA TCATATTTAT TATTCAGCTT
AACAGGTCGG AACAACATTT GAGAAAAGAA AATAAGCGTT CAGGGATTCT TGCCCAGCAA
AAAATGGATT TGTTAAACCG GATGAGCCAT GAAATACGTA ATCCTTTGAA TGCCATTAAA
GGATTTCTTT ACGTATTTAG CAGAACTAAT CTTTCTAAAG AGCAAACAAC GATGATCGAC
TCCATAAAAC TCTCGTCAGA CATGCTGCTT CAAACCCTTG ACGATACACT GGATGCGTCA
AAAATGGAAA ACAGTGAGTT TAAGATAAAC AAGGAGCCCT TTAATCCAGA CCTGACCCTT
CGAAAGGTTA TACAAAATAT GGAATATGGG GCAGCTGAAA AGAACCTGAA CATAGCTTAT
TTTTATTCCG GAAACTCTGA GGCCATAGTT AACGGAGACA GTTTCAGACT TAAACAAATC
CTGGTTAATC TACTCAGCAA TGCTATAAAA TATACTAAAG AAGGAGGGAT TATTGTGACC
GCAGAAATGA GCAATGACAA ACTATATGTT GCAGTTAAAG ATACAGGCGC CGGTATCAGT
TTAGAACAAC AATCCGGACT GTTCTCCAAA TATTACCAAA CAAATTCCTC AAAAGGACAA
ACAGGGACAG GATTGGGTTT ATTTATTTGT AAACAACTCG TGGAATTACA AAATGGAAAG
ATTCATGTAG AAAGTACACC AGGCATTGGC AGTACTTTTA GCTTTTATAT ACCATACAAA
AAAGCTGAAC ATAAGTTACC TATTAAGCAT GATCAGGAAA ATCCTTTGTT GTTTGCTGAT
GATTTGCTAT TCCTTAATGA ACGGAGTGTT TTGTTTGTTG ATGACAGTAA GCTCAATCTG
ATTTTTCTTA AAATGATGAC CAGCAAATGG AATTTAAAAT TTCGGGAAGC CTCTAATGGC
AAGCAGGCCT TGGCTATTAT TGCTAAGCAT AAAATAGATA TTGTCTTAAC TGATATTCAA
ATGCCTGAAA TGGACGGATA TGAATTGTTA TCGGCTATAA GATCATTGGC TGAACCTTTT
AACCGTTTGC CTGTAATCGC CGTTAGTGGA GAATCTGGTC TTGAAAGCGA AAAAAAACTC
ATTAAAAAAG GCTTTTCAGG CCTGATAAAT AAGCCGGTTG ATGAACAAAT TTTGAAACAG
CAACTACTTA AGGTAATAAA ATCAAACTTG AAAGATCAGC TTTCTAAGTT TAATGTATAG
 
Protein sequence
MEIKYTKVLA RNILLTFLVF SIIFAVAALF LYNNITKKLA GISELASHID REQEKPERAL 
ILLQQAEDDF QQYLLDINSK KDIDYRKNLS LAFYEVDALL HENKNIAYLT VAQRDKMKQL
NQKRLKLSKE FILLKHDFDS LLSVYAKFND KTNSDLNGKK PILSPAKINQ TDTIKKLIPL
EKRSFFKRIK DAILNRDTSR ATRVIEIRPK DSGSAAFIGK NTINIYGQAY AKRLLQLNKQ
NAKLLDMHRN LISLSIHTSS ELRRINNGVR EINANMAENL KRMAFKNYRE TTALLTKVHL
AALLLLLLFG TLLIIFIIQL NRSEQHLRKE NKRSGILAQQ KMDLLNRMSH EIRNPLNAIK
GFLYVFSRTN LSKEQTTMID SIKLSSDMLL QTLDDTLDAS KMENSEFKIN KEPFNPDLTL
RKVIQNMEYG AAEKNLNIAY FYSGNSEAIV NGDSFRLKQI LVNLLSNAIK YTKEGGIIVT
AEMSNDKLYV AVKDTGAGIS LEQQSGLFSK YYQTNSSKGQ TGTGLGLFIC KQLVELQNGK
IHVESTPGIG STFSFYIPYK KAEHKLPIKH DQENPLLFAD DLLFLNERSV LFVDDSKLNL
IFLKMMTSKW NLKFREASNG KQALAIIAKH KIDIVLTDIQ MPEMDGYELL SAIRSLAEPF
NRLPVIAVSG ESGLESEKKL IKKGFSGLIN KPVDEQILKQ QLLKVIKSNL KDQLSKFNV