Gene Phep_1778 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1778 
Symbol 
ID8252881 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp2075531 
End bp2076508 
Gene Length978 bp 
Protein Length325 aa 
Translation table11 
GC content40% 
IMG OID644935429 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_003092049 
Protein GI255531677 
COG category[K] Transcription 
COG ID[COG4977] Transcriptional regulator containing an amidase domain and an AraC-type DNA-binding HTH domain 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones10 
Plasmid unclonability p-value0.545321 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones16 
Fosmid unclonability p-value0.0235367 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGAAACATA TATCCATACT TATCCCAACA GGGGGCGTAT TGGGCGGTAT TGAAATCGCC 
CGCAATATGA TTGCAGAGGC CAATAATTAT GTCATCAGTA AGGGGAACGA GGCCGTGTTT
CAAATGCAGC TTGTAGGAGC AAAGAAAAAT ATTGAACTGA ACGAGCGTCA AGTCTTAATT
CGACCGGATG TTACAATAAA TTATTTAAAA CAAACCGATC TTATAATTAT TCCAGCGATC
AATGTCGATT TGGAACTCAA CAATAATCCA AATGTGCCGC TTATCAACTG GGTAAGGCAG
ATGCGTAAAC AAGGTGCTGA GGTGGCATGT CTCTGTGTTG GTGCCTTTTT GCTAGCTAAG
ACCGGGATTT TAGACGGTAA AACCTGTACA ACGCACTGGA AAGCTTCAGA GCTTTTCATC
AATCAATACC CCGAGATAAA GCTTTTAAAG GAAAAAATCA TTACGGATGA AGATGGAATA
TACTCTAGCG CCGGTGGATT TTCTATGCTA AACCTCATAG TATATCTGAT TGAAAAATAC
GCCGGCAGGG AAGTTGCCTT GTATTGTGCT AAGTTTTTCC AGGTAGATAT CGACAGGAAT
GCACAGTTGC CATTTATAAT TTTTCAGGGA CAAAAAGAAC ACAACGATAT TCAGGTTAAA
CAGGCCCAAA CATACATCGA GGAACACTAC CAACAGCGCA TTACAATAGA TCAGTTAGCA
GAGATGCTGG CAATGGGTCG CCGTAGTCTA GAAAGAAGGT TTAAAAAGGC AACTTCCAAT
ACGCTGAACG AATACATACA ACGAGTAAAA ATTGAGGCAG CAAAAAAACA ATTGGAAAGT
GGTGAAAAGA AGATTAATGA CATCATGCTG GAAGTGGGGT ATTCCGATCA GAATAGTTTC
AGGCACTCCT TCAAATTATT TACCGGGTTA CTGCCGAATG AATACCGAAG CAAATACCAT
AGCAGAATTA CTAATTAA
 
Protein sequence
MKHISILIPT GGVLGGIEIA RNMIAEANNY VISKGNEAVF QMQLVGAKKN IELNERQVLI 
RPDVTINYLK QTDLIIIPAI NVDLELNNNP NVPLINWVRQ MRKQGAEVAC LCVGAFLLAK
TGILDGKTCT THWKASELFI NQYPEIKLLK EKIITDEDGI YSSAGGFSML NLIVYLIEKY
AGREVALYCA KFFQVDIDRN AQLPFIIFQG QKEHNDIQVK QAQTYIEEHY QQRITIDQLA
EMLAMGRRSL ERRFKKATSN TLNEYIQRVK IEAAKKQLES GEKKINDIML EVGYSDQNSF
RHSFKLFTGL LPNEYRSKYH SRITN