Gene Phep_4177 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_4177 
Symbol 
ID8255312 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp5050887 
End bp5053214 
Gene Length2328 bp 
Protein Length775 aa 
Translation table11 
GC content46% 
IMG OID644937842 
ProductRNA binding S1 domain protein 
Protein accessionYP_003094430 
Protein GI255534058 
COG category[K] Transcription 
COG ID[COG2183] Transcriptional accessory protein 
TIGRFAM ID[TIGR00426] competence protein ComEA helix-hairpin-helix repeat region 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones13 
Fosmid unclonability p-value0.00696361 
Fosmid HitchhikerYes 
Fosmid clonabilityhitchhiker 
 

Sequence

Gene sequence
ATGAGCAATC ATTCTAAAAT CATTGCTGCT GAACTGGCAG TAACGGAAAA GCAGGTCATC 
GCTACTATTG AATTACTGGA TGAGGGGGCA ACGGTTCCCT TTATTTCACG TTACCGGAAA
GAAGTGACAG GAAGTCTGGA TGAAGTGCAG GTAGCAGGTA TCCGCGATAG GTTCCAACAG
TTAAGAGAGC TGGATAAACG TCGGGAAGCC ATTTTAAAAG CCCTAACAGC GCTGGATAAA
CTTACACCTG AGCTGGAAGC ACAAATTAAT GCGGCAACCA ATATTGCAGC AATTGAAGAT
ATTTATCTGC CTTATAAGCC CAAACGCAAA ACCAGAGCAT CTGAGGCCCG GAAAAAGGGA
TTGGAACCCT TGGCGTTATT TATCCTGGAG CAGGCTAAAA CGGATCCGGA AATTGAAGCA
GGTAAATTTT TAAATGATGA GCTTGGGGTC GGATCTTTAG AAGATGCGCT TGCTGGTGCA
AGGGATATCA TTGCCGAATT GGTCAATGAA AATGCAGAGG CCAGAACTGC AATGCGGAAT
TATTTCCAGC AAAAAGCTGT ATTTAAGTCT GCCGTAATTA AGGGCAAAGA AGAAGAAGGG
ATTAAGTATA AAGATTATTT TGACTGGCAG GAACCGTTAA AAAATGCGCC ATCACACCGG
GTTCTGGCTA TACGCAGAGG CGAAAATGAG GCGATCCTGA AACTGGAGGC CATGCCCGAG
GAAGATGGTG CCATCAGGAT TTTGGAAAGC CAGTTTGTTA AAGGGAATAA TGCCTGTGCA
GATCAGGTAA AACTGGCTAT ACAGGATAGT TATAAGCGTT TGCTGGGGCC TTCGATGGAA
ACGGAGATCC GTAATCTTTC TAAGGAAAAA GCAGATGAAG AAGCCATTCG TGTATTTGCA
GAAAATGCAC GCCAATTGTT GCTCGCCGCC CCAATGGGAC AGAAAAATGT GCTGGCTATA
GACCCTGGTT TCCGTACAGG TTGTAAAGTG GTATGTTTGG ATAAGCAGGG CAAATTATTG
GAAAATGCAA CCATATATCC ACATACCGGA CAGGGGAATG TTAAAAAAGC GGGCGACAAG
ATTAAGGAGC TTTGTGCCAA ATATGAAATT GAGGCTGTTG CTATCGGCAA TGGTACCGCG
GGCCGCGAAA CAGAAGTGTT TGTACGTGCG CTGAATATTC CGGGCATCGT AGTTGTGATG
GTGAGTGAAA ACGGGGCCTC CATTTATTCC GCTTCGGAAG TGGCGCGTGA AGAATTCCCT
ACACAGGACA TTACGGTCAG AGGTGCTGTT TCCATTGGCC GCAGGTTAAT GGACCCGCTG
GCCGAGCTGG TAAAAATAGA CCCTAAATCG ATAGGTGTAG GGCAGTACCA GCATGATGTA
GACCAGAACA AACTGCAGGC CTCACTGGAT GATACGGTGA TGAGTTGTGT AAATGCTGTA
GGTGTGGAAT TGAATACGGC CTCTAAACAG GTGCTTGCCT ATGTTTCAGG CTTGGGTCCG
CAGCTGGCAC AAAATATTGT GGCTTACCGC AATGAGCACG GTGCATTCAA AAACCGTGAA
AGTTTGAAGA AAGTTCCCCG TTTGGGTGAT AAAGCTTTTG AACAGGCGGC AGGATTTTTA
AGGATCAGGA ATGCAGAAGC CGTATTGGAT TCCAGCGGTG TACACCCGGA ACGTTATGCA
TTGGTCCATA AAATGGCCAG AGACCTGAAC TGCAGCATTA ATCAGCTGGT AAAAGATCCG
TCTTTACAGC AGCAGATTAA GCTACAGCAA TATGTAAGTG ACGATATTGG CTTACCAACC
CTGAACGACA TCATGAGCGA ATTGGCTAAG CCAGGGAGAG ATCCGCGTGA ACAGTTCGAG
GTGTTTAGTT TTACGGAAGG GGTAAATGAA ATTTCCGACC TGAAAGTGGG TATGAAATTG
CCGGGTATTG TTACCAATAT TACCAATTTT GGTGCTTTTG TAGACATCGG TGTACACCAG
GACGGCCTGG TGCATACCAG CCAGATCTCG GACAAGTTTG TGGCCAATGC CGCAGATGTG
GTTAAGGTAC ATCAGAAGGT AGAGGTGACC GTGGTTGAAG TTGATGTGGC CCGTAAAAGG
ATTTCTCTGT CTATGAAAAA AGAAGTTGGC GCAAAACCTA AAAGCGAACA GCATAAGCAG
GCTGATAAGG CTAAAAAGTT TGTGAAGACT AATGGATCTG CCCAACATAA AAAGCCAATA
AAGCCGCAAC ACAGTCATAA AGGGGGGCAA AAAGAAAAGC CGCTGCCCGA TGGTGACCTG
CAGATGAAAT TAGCGGCGTT GAAAAATATG TTTAAAACTG AAAAATAA
 
Protein sequence
MSNHSKIIAA ELAVTEKQVI ATIELLDEGA TVPFISRYRK EVTGSLDEVQ VAGIRDRFQQ 
LRELDKRREA ILKALTALDK LTPELEAQIN AATNIAAIED IYLPYKPKRK TRASEARKKG
LEPLALFILE QAKTDPEIEA GKFLNDELGV GSLEDALAGA RDIIAELVNE NAEARTAMRN
YFQQKAVFKS AVIKGKEEEG IKYKDYFDWQ EPLKNAPSHR VLAIRRGENE AILKLEAMPE
EDGAIRILES QFVKGNNACA DQVKLAIQDS YKRLLGPSME TEIRNLSKEK ADEEAIRVFA
ENARQLLLAA PMGQKNVLAI DPGFRTGCKV VCLDKQGKLL ENATIYPHTG QGNVKKAGDK
IKELCAKYEI EAVAIGNGTA GRETEVFVRA LNIPGIVVVM VSENGASIYS ASEVAREEFP
TQDITVRGAV SIGRRLMDPL AELVKIDPKS IGVGQYQHDV DQNKLQASLD DTVMSCVNAV
GVELNTASKQ VLAYVSGLGP QLAQNIVAYR NEHGAFKNRE SLKKVPRLGD KAFEQAAGFL
RIRNAEAVLD SSGVHPERYA LVHKMARDLN CSINQLVKDP SLQQQIKLQQ YVSDDIGLPT
LNDIMSELAK PGRDPREQFE VFSFTEGVNE ISDLKVGMKL PGIVTNITNF GAFVDIGVHQ
DGLVHTSQIS DKFVANAADV VKVHQKVEVT VVEVDVARKR ISLSMKKEVG AKPKSEQHKQ
ADKAKKFVKT NGSAQHKKPI KPQHSHKGGQ KEKPLPDGDL QMKLAALKNM FKTEK