Gene Phep_0022 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0022 
Symbol 
ID8251106 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp24424 
End bp27765 
Gene Length3342 bp 
Protein Length1113 aa 
Translation table11 
GC content45% 
IMG OID644933671 
Producttranscription-repair coupling factor 
Protein accessionYP_003090310 
Protein GI255529938 
COG category[K] Transcription
[L] Replication, recombination and repair 
COG ID[COG1197] Transcription-repair coupling factor (superfamily II helicase) 
TIGRFAM ID[TIGR00580] transcription-repair coupling factor (mfd) 


Plasmid Coverage information

Num covering plasmid clones
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
GTGAATATTC GCGACCTGAT CAGCAGATAC AAGACAGATG AGCGTATTGT AGCATTGGCA 
AAGGCACTAA ACGCCGGTAA AGGCACTAAA CTTCAACTAA AGGGTCTAGT AGGTTCTGCC
GATGCAACAA TTGCTGTGGC CATCTATTTC CTGTTACATA AACCCCAGCT ATTTGTACTT
CCCGATAGGG AAGAGGCGGC CTATTTTCTG GCCGACCTGG AAAGTATCCT CGATAAAGAG
GTACTGCTCT TTCCTTCTTC CTTCCGTAAA GCCTTTGATT TTACACAAGT GGATACCGCC
AATGTGCTGG CGCGTGCCGA AGTACTGAAC GAACTGAACC ACCAGTCAGA ATACGGCAAA
ATTGTGGTTT CCTATCCTGA GGCCCTGGCC GAAAAGGTGA TAGACCGCAC CGTGCTGGAA
AAAAACACGC TGGAGATCAG TTTAAATGCC AAACTGGGCA TCGATTTTAT CAATGAGTTT
CTAATCGACT ACGATTTTAA CCGTACAGAT TTTGTATATG AACCCGGACA GTTTTCCATC
CGTGGCGGTA TCGTCGATAT CTTTTCCTTC TCGCACGACC TGCCTTACCG CATCGAGTTT
TTTGGCGATA TCGTGGAAAG TATCCGTACT TTTGAAATTG AAAGCCAGCT TTCCGTTGAA
GATGTAAAAA CACTGACCAT CATTCCCAAT GTACAATCAA AATACCTTAC CGAAACCAAC
ATCAGTATCC TCGACTACAT TGAAAAGGAT ACCCAGGTTT GGTTTAAGGA CGTAGAATTC
ACTTTGGACA TTGTTAAAAG TGGATATAAA AAAGCGGTTG AACTATGGAA AGCCCTGCCC
GCAAAAGAAA AACAGGAAAA CCAGGACTGG ATAGATCCTA AATTTGCTTT TACAGACGAG
AAAATGATGG GCGATATGTT CCACGATTTT CCGCTTGTAG AATTTGGAAA GCAGTTCTTT
TATAAAACAG ATCAGGTTTT TCAGTTCGAA ATCAAACCCC AACCCTCTTT TAACAAAGAT
TTCAGCCTGC TGATCCACAA CCTTAAAGAA AACGAAAAAC AGGGCCTGCA TAACTTTATT
TTCAGCGCTT CAGTTAAACA AACGGAACGT TTATATGCCA TCCTGGACGA TATCGACAAA
TCTGTCAAAT TTACACCGGT AAATATCCCG CTCAGGGAGG GTTTTGTAGA TGGGGGGCAG
AAAGTGGCCT TTTATACCGA TCACCAGATT TTCGACCGTT TTTATAAATA TAAGCTCAAA
AGAGGCTATC AGCGCAGTCA GGCTATCACC TTAAAAGAAC TGAGGGACCT GAAGCCCGGC
GATTTTGTAA CCCATATAGA CCATGGGATA GGTAAATATG CCGGTTTGGA AAAGGTAGAG
GTAAACGGTA AAACCCAGGA AATGATCAGG CTGGTCTATG CCGATAACGA CCTGCTGTAT
GTCAACATCA ACTCCCTTAA CCGCATTGCC AAATACAGTG GAAAAGATGG TACTGCTCCA
AAAATGAACA AACTGGGCAC AGAGGCCTGG GATAAGCTCA AAAAAACAAC TAAAAAAAAA
GTTAAGGACA TTGCCCGAGA CCTGATCAAG CTGTATGCCC TGCGAAAATC GCAGGTAGGA
ACAGCTTTCG CGCCCGATGG CTATCTTGAA ACCGAGCTGG AAGCCTCCTT TATTTATGAA
GATACGCCCG ATCAGGAAAA GGCAACCAGC GATGTAAAAA AGGACATGGA AGCGCCACAT
CCCATGGACC GGCTGGTTTG TGGTGATGTG GGCTTCGGTA AAACGGAAAT TGCCATCAGA
GCTGCTTTTA AGGCGGTTGC CAATGGCAAA CAGGCGGCCG TACTGGTGCC AACCACCATT
CTTGCCCTCC AGCACTTTAA AACTTTTACC GGTCGATTAA AGGACTTTCC CTGCACCGTA
GATTACATCA ACCGCTTTAA AACAAGCAAG CAGATCAAAG AAACGCTGGC TAAAGTGGCC
GAAGGCAAAG TAGATATCAT CATTGGCACG CACCGCCTGC TCAGTAAGGA TGTGAAGTTT
AAAGATCTGG GCATCATGAT CATCGATGAG GAGCAGAAAT TCGGGGTATC TTCCAAAGAG
AAATTAAGGG CTTTAAGGGT AAATGTAGAT ACCTTAACCC TTACCGCAAC CCCCATACCC
AGAACACTGC ACTTCTCTTT AATGGGGGCC CGCGACCTTT CCATCATGAG TACCCCGCCA
CCAAACCGGC AGGCTGTAAA TACAGAATTA CATGTTTTTA ACGATAAACT GATCCAGGAA
GCCGTTCAGT TTGAACTGGA CAGGGGCGGA CAGGTGTTTT TTATCCATAA CAGGGTGCAC
GACCTCCCTC AACTGGGGGG ATTGATCCAG ACGCTGGTCC CAAAAGCCCG TATTGGTATT
GCACACGGAC AACTGGATGG CGATCAGCTG GAAGATGTGA TGCTCGATTT TATCAATGGA
GAAAAGGATG TCCTGGTAGC CACCACCATT ATCGAGGCCG GTCTTGACAT CCCAAATGCC
AATACCATCA TCATCAACCA TGCGCATATG TTTGGCCTGA GCGACCTGCA CCAAATGCGA
GGCAGGGTAG GCCGGTCCAA TAAAAAAGCC TTCTGTTACC TGCTTAGCCC GCCTTTAAGT
ACGCTTACTT CGGAAGCCCG CAAACGCCTG AGTGCAATTG AAGAGTTTTC AGACCTTGGA
AGCGGTTTTA ACATTGCCAT GAGGGATCTG GACATCCGGG GTAGCGGAAA CCTCTTAGGC
GCCGAGCAAA GTGGCTTCAT TGCCGAAATA GGTTTCGAGA TGTACCACAA AATACTGGAT
GAAGCCATAC AGGAACTGAA AACAGATGAA TTTAAAGACC TGTTTAAGGA TGAGCCCTTA
CGCCCCTTTG TGAATTTTAC ACAAGTGGAT ACCGACCTGG AGCTGTATAT TCCGGATGAT
TATGTAACCA ACATTACCGA AAGGTATAAC CTCTATACCG AACTTTCAAA AATTGAGGAT
GAAAACCAGC TAAAGGCGTT TGAACTGAGT TTAAAAGACC GTTTTGGTCC TGTTCCTCAT
CCGGTAAAAA CCATGTTGAA TGTATTGCGG CTGCAATGGA TAGCCAAAAA ACTGGCTTTC
GAAAAAATAA GTTTCAAAAA AGGTGTATTG CGCGGTTATT TCATTACCGA CAAACAATCT
GCCTTTTTTG ATTCTATTAT GTTCAATAAA ATATTGCATT TTGCACAGAT CCACCCGCGT
TTATGCAATC TTAAAGAAGT TAAAGATTCA CTCCGCATTT CATTCGACAA CCTCAATACC
ATAGATGAAG CTGTAGAAAT GCTGGAAATG GTGGTCAATT AA
 
Protein sequence
MNIRDLISRY KTDERIVALA KALNAGKGTK LQLKGLVGSA DATIAVAIYF LLHKPQLFVL 
PDREEAAYFL ADLESILDKE VLLFPSSFRK AFDFTQVDTA NVLARAEVLN ELNHQSEYGK
IVVSYPEALA EKVIDRTVLE KNTLEISLNA KLGIDFINEF LIDYDFNRTD FVYEPGQFSI
RGGIVDIFSF SHDLPYRIEF FGDIVESIRT FEIESQLSVE DVKTLTIIPN VQSKYLTETN
ISILDYIEKD TQVWFKDVEF TLDIVKSGYK KAVELWKALP AKEKQENQDW IDPKFAFTDE
KMMGDMFHDF PLVEFGKQFF YKTDQVFQFE IKPQPSFNKD FSLLIHNLKE NEKQGLHNFI
FSASVKQTER LYAILDDIDK SVKFTPVNIP LREGFVDGGQ KVAFYTDHQI FDRFYKYKLK
RGYQRSQAIT LKELRDLKPG DFVTHIDHGI GKYAGLEKVE VNGKTQEMIR LVYADNDLLY
VNINSLNRIA KYSGKDGTAP KMNKLGTEAW DKLKKTTKKK VKDIARDLIK LYALRKSQVG
TAFAPDGYLE TELEASFIYE DTPDQEKATS DVKKDMEAPH PMDRLVCGDV GFGKTEIAIR
AAFKAVANGK QAAVLVPTTI LALQHFKTFT GRLKDFPCTV DYINRFKTSK QIKETLAKVA
EGKVDIIIGT HRLLSKDVKF KDLGIMIIDE EQKFGVSSKE KLRALRVNVD TLTLTATPIP
RTLHFSLMGA RDLSIMSTPP PNRQAVNTEL HVFNDKLIQE AVQFELDRGG QVFFIHNRVH
DLPQLGGLIQ TLVPKARIGI AHGQLDGDQL EDVMLDFING EKDVLVATTI IEAGLDIPNA
NTIIINHAHM FGLSDLHQMR GRVGRSNKKA FCYLLSPPLS TLTSEARKRL SAIEEFSDLG
SGFNIAMRDL DIRGSGNLLG AEQSGFIAEI GFEMYHKILD EAIQELKTDE FKDLFKDEPL
RPFVNFTQVD TDLELYIPDD YVTNITERYN LYTELSKIED ENQLKAFELS LKDRFGPVPH
PVKTMLNVLR LQWIAKKLAF EKISFKKGVL RGYFITDKQS AFFDSIMFNK ILHFAQIHPR
LCNLKEVKDS LRISFDNLNT IDEAVEMLEM VVN