Gene Phep_1038 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_1038 
Symbol 
ID8252132 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp1214267 
End bp1215451 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content38% 
IMG OID644934691 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_003091320 
Protein GI255530948 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones11 
Plasmid unclonability p-value0.895405 
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones25 
Fosmid unclonability p-value0.923131 
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
ATGGGCTTTT TGAATTTTTT AACCAGCATT ATCTGCTTTT TTTTATTGCT GTTCTCCTTC 
AACCTGTTTT TCGCCAAACG GGGTAATCCT GTATTGAATT ATTTACTGGG AGTTATCTTT
TTTTCGAGAT TCGGTCAAAT GTTGGTCCTT TTATTGGTAA ACTCAAAACA GCAGACTTAT
TTTCCTTTTT TTCATCAGCT TTTTACCCCA TTGTACTTTG CCGCCCCGGC TTGCTTTTAC
CTTTATGTCA GCCATTTTAT TAACCCGGAT AAAAAATTAC CCTACAAGGA GTGGCTTCAT
TTTATCCCTG CTGCCCTGGC CATTATCCAT GTTATCCCAT GGCCTTTTGC TCCGGTTATC
AACTGGCACG ATATCTTAAA ACAGATCACT GATAATAAAC AGCTTTTTAT TAACGAAAGG
AGCGGGATAT TACCGGCCTA TTTATATTCT GTAGGCAGGC CTGCATTGGT ACTTGGTTAT
TTAGCCGCCA CCTGGTATAG GGTGTTAAAT TCGGAGGTAT TGAAAGCAAA ACCCAAAGCA
GATACTGGTA AGAAATGGAT CTTACTTTTT GTAAAGGCAG CAACGTTTTT CCAGCTCGTT
AGTTTTTTGC CATTGCTCAG TACAAGCCAG GACAGAACTT ATGCCAATTC CATATTTGTG
ATCATCAGCT GTCTCGTTTT GATTGTTATT GTTGTGTTCA TCCTACATCA GCCGGATATT
TTTTACCGTT ATTTAATAAC GCCTATTGAT GGTATAAAAG TCGCAGATAA AGGCCAGGAA
AGGGTAGAAG ATACTACATT GAATACCGGT AGTACTAAAA AAATCATCTT ATTACCTGAA
CAATCAGCTG CATACGCAGC TGAGATGGAA GCCTTAATGG CAGCTAAAAA GTTATACCTG
ATATCGGATT TTCAGATTGT TGACCTGGCT GCTGAAATGA ATATTTCTGT TCATCACTGC
TCATTTGTAA TCAACAACGT AATCGATAAA AACTTTCGTG ATTGGATAAA TGGTTACCGC
ATTAGCTATT TTAGCACACA ATATCCACTT CACGCGCACA AAATGACCAT TGAAGCCATT
GCTCATGAAT CTGGCTTTAA AAGCCTGGCA ACTTTTTACA ATGCCTTTAA AAAAGAGACA
GGTTTGATGC CCAAAGCCTA TTTCTCACAA AAGAAGGTAT CATAA
 
Protein sequence
MGFLNFLTSI ICFFLLLFSF NLFFAKRGNP VLNYLLGVIF FSRFGQMLVL LLVNSKQQTY 
FPFFHQLFTP LYFAAPACFY LYVSHFINPD KKLPYKEWLH FIPAALAIIH VIPWPFAPVI
NWHDILKQIT DNKQLFINER SGILPAYLYS VGRPALVLGY LAATWYRVLN SEVLKAKPKA
DTGKKWILLF VKAATFFQLV SFLPLLSTSQ DRTYANSIFV IISCLVLIVI VVFILHQPDI
FYRYLITPID GIKVADKGQE RVEDTTLNTG STKKIILLPE QSAAYAAEME ALMAAKKLYL
ISDFQIVDLA AEMNISVHHC SFVINNVIDK NFRDWINGYR ISYFSTQYPL HAHKMTIEAI
AHESGFKSLA TFYNAFKKET GLMPKAYFSQ KKVS