Gene Phep_2003 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_2003 
Symbol 
ID8253107 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2312458 
End bp2314446 
Gene Length1989 bp 
Protein Length662 aa 
Translation table11 
GC content31% 
IMG OID644935652 
Productputative ATP-dependent helicase 
Protein accessionYP_003092271 
Protein GI255531899 
COG category[R] General function prediction only 
COG ID[COG3972] Superfamily I DNA and RNA helicases 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value0.148023 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones12 
Fosmid unclonability p-value0.00375343 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGGCCGCTC AATTGTTTAA AAATCTCAGT AAAAATGAAC TAGGTGAAAA TATCGCCAAT 
AAGTTTGAAG AGTATTTAAC AAACCATTCA AATAAACAGA TCTACTTAAT TACATCCCCC
TTAGGAGAAA AGTATAGTTA TGATTATGAA GATAACTGTT TGGTCATATT AATCCCAAAA
CACAGAATAA TCTTTTTGAA TCTTGGAAAT AACGATGAAA AATTTCAAGA CTATTGTGAT
GACTTTATTG AAGATTTAAA TTCGATTTCT GACAAATACA ACTACAAAGA ACATATAGGC
CGACCAAGGG ATTGGAAAAA GAACAATACT GTTCGAATTG AAAATTTTGA CCTCAATAAT
ATAGAAAGCG TTGTCGCCGA AAACTTTCTA GAAGATCTTA AATCACAAAG AATAAGCGAC
TTACTTATTT CATTATTAAT CGGAAGTATT AACGACATTG AAAACATTGG ATCTGAAGTC
CCAGAAACGC TGCTCGAAAA AGTTAAGAAA AACATTGTTT TATTTGATGG TGAGCAAACC
AGATTTATGT ATCAGGATTT CGACAATAAA ACAATCACTA TTCAAGGGTT GTCAGGTACA
GGAAAGACCG AACTTTTGTT ACATAAACTA AAAGACTTGT ATGTTAAAAC AAGTACCTCA
AAGATATTTT TCACCTGTCA TAATATAGCC TTAGCGAATA CATTACAAGA AAGAGTCCCT
ATTTTTTTCA ACTTTATGAG AGTAGAGAAA CAAATAGAAT GGAATAAACG ATTATGGGTT
AATAGAGCAT GGGGATCTAA AGGAGACCCC AATTCAGGCA TATATAGTTA TTTATGCTAC
TTTTACGACA TACCCTTCTT AAGATTTAGT CCAACAAATG ATTATAATAG AATATTCACT
TTAGCATTAG ATTACATTGA GAGTATTGAC CCTAAAGATT TCGAATATGC TTTTGATTAC
CTATTGATTG ATGAACGTCA AGATTTTCCA GATGTCTTTT TTCAGGTATG TGAAAAAGTA
ACCAAAGAAA ATGTTTATAT AGCGGGAGAC ATATTCCAAG ACATTTTCGA GAACATTGAT
AAAAAAGTCT TACAAGTAGA TGTCGTACTT AATAAATGCT ATCGAACCGA TCCTAGAACG
CTAATGTTTG CTCACTCAGT AGGACTAGGC TTGTTTGAAG AAAAGAAATT AAATTGGTTT
GACGACGACG AATGGAATGC ATTCGGATAT AATGTAAAAA GACTTGCTGA CAAGGAAATA
CAGTTTACTA GAGAACCACT AAGAAGATTT GAAGATATTG ACACAGACAA ATTTGAAAGT
GTTGAAATTA TAAAATCTAC TAAAAGCTCA AAGGTTATCA AAGCTATTGA AAGCATAATC
CGAGAAGATG AAACAGTAAG CCCTAATGAT ATTGCAATAA TTATTTTGGA TGATGATAAA
CAAATCTATG AATACATTGA TCATCTCTCT TTAATTATAA ACAAAAGATT CGGCTGGAGA
ATAAACAGAG CACACGAAAG TAAGTCATTA ATTGATAACA CTTTATGTAT CTCAAACTCA
AACAATGTTA AAGGTTTAGA ATTTCCATTT GTAATATGTA TCACTGGAAC AATTAAGCAA
ACGTACAGAT ATAGAAATAT ACTTTACACA ATGCTAACCA GATCTTTTAT TAAGTCCTTT
TTGCTTGTAA ACGAAAAAAA TGATATCAAA CATCTAGAGA AAGGATTAAA AATAATAAAT
CAAACTAAAG CAATAAAGAC AACAGAACCT ACCCCTAAAG AACAAAAAGA AATCAAAAAC
AACTTAGTTG GCTTCTTAAG CAGTTCACAG AAATCTTATA AGGAATTTCT AACTGAAATA
TTTGACAAGT TAGCCATTGA GGAAACAAAA CGTAAAAATA TCGAAGACGT TTTAGTTAAT
GCTAATATTG ATAAATTTGA TGAAGAACGT ACCTCTGCAT TTATAGTTTC TTTAAAGGAG
TATTACTAG
 
Protein sequence
MAAQLFKNLS KNELGENIAN KFEEYLTNHS NKQIYLITSP LGEKYSYDYE DNCLVILIPK 
HRIIFLNLGN NDEKFQDYCD DFIEDLNSIS DKYNYKEHIG RPRDWKKNNT VRIENFDLNN
IESVVAENFL EDLKSQRISD LLISLLIGSI NDIENIGSEV PETLLEKVKK NIVLFDGEQT
RFMYQDFDNK TITIQGLSGT GKTELLLHKL KDLYVKTSTS KIFFTCHNIA LANTLQERVP
IFFNFMRVEK QIEWNKRLWV NRAWGSKGDP NSGIYSYLCY FYDIPFLRFS PTNDYNRIFT
LALDYIESID PKDFEYAFDY LLIDERQDFP DVFFQVCEKV TKENVYIAGD IFQDIFENID
KKVLQVDVVL NKCYRTDPRT LMFAHSVGLG LFEEKKLNWF DDDEWNAFGY NVKRLADKEI
QFTREPLRRF EDIDTDKFES VEIIKSTKSS KVIKAIESII REDETVSPND IAIIILDDDK
QIYEYIDHLS LIINKRFGWR INRAHESKSL IDNTLCISNS NNVKGLEFPF VICITGTIKQ
TYRYRNILYT MLTRSFIKSF LLVNEKNDIK HLEKGLKIIN QTKAIKTTEP TPKEQKEIKN
NLVGFLSSSQ KSYKEFLTEI FDKLAIEETK RKNIEDVLVN ANIDKFDEER TSAFIVSLKE
YY