Gene Phep_0106 details

Gene Information       Plasmid Coverage information       Fosmid Coverage information       Sequence       

Gene Information

Locus tagPhep_0106 
Symbol 
ID8251191 
TypeCDS 
Is gene splicedNo 
Is pseudo geneNo 
Organism namePedobacter heparinus DSM 2366 
KingdomBacteria 
Replicon accessionNC_013061 
Strand
Start bp114607 
End bp115791 
Gene Length1185 bp 
Protein Length394 aa 
Translation table11 
GC content43% 
IMG OID644933756 
Producthelix-turn-helix- domain containing protein AraC type 
Protein accessionYP_003090394 
Protein GI255530022 
COG category[K] Transcription 
COG ID[COG2207] AraC-type DNA-binding domain-containing proteins 
TIGRFAM ID 


Plasmid Coverage information

Num covering plasmid clones12 
Plasmid unclonability p-value
Plasmid hitchhikingNo 
Plasmid clonabilitynormal 
 

Fosmid Coverage information

Num covering fosmid clones31 
Fosmid unclonability p-value
Fosmid HitchhikerNo 
Fosmid clonabilitynormal 
 

Sequence

Gene sequence
TTGCACCTGG TATTCATCAC AATTATTGGT TCGATTTTTA TTCTTTGCCT GGCCATTGTA 
AAGCTGTTGC TTTTCAGTAA GGGAAGGCAG TATTTAAACC TTTTGCTGTG CATAGCCATT
TTTGGGGTAA TTTGGTACGG CATTATTTAT TTGCTTACCA ACTCTGGGCT GATAAAAAAT
CACCCTGTAC TCTTTAATAA AGGCTTACCA CTTTATTATC TCATTGCACC ATGCTTTTTT
CTCTACATCA GGGGGTCCCT GCAGCCAGAT TACGCTGTTT TCAGGTTAAA GTACCTGTTG
CACCTAGTCG TTGTTATTCC GGCCCTGGTT GCTATTGTCC CATATAGTTT TGCCGATACT
GCTACACAAC AATGGGTGGT AAACCAGATA GACAAGGATG TCGCTTTTGC CTTTAGTAAC
AACAGGTATA TCGTGCCCAA CTGGCATTGG TTCACTTTTC CACTATCTGC ACTTTTATAT
ACACTGGCGC AATTCAGTTT TGCAATAGGG TATGCCAAAA CAAAAAAGTA TGAAAAAAAG
ACGCTCACCT GGGTATTGGC ATTTACGGTT ATTTGCGGAA TTATTTTCTT TGGAATGCTT
GCTGTAAATT TAAGTGTGCT GTCAAATTTC AATGATATCT GGCGTATTCT GCACTCCGGA
AGGCTGGTGT TATTCCTGGG TTTAGCAATG CTGGCGCTCA GTGGTTCTTT TTTTTTAAGC
CCCAGCCTTA TCTTTGGCTT TGTCCGCTTG AAACCAGCAA GCCAGGCCCG TACTGCTGAA
GCGCGTTCAG GCCGCCTGGA AGGGGAGCTC CATGAAAACC GGGTTAAGAT GTATGATACT
TCCCTTATTG TAAGGGTCGA AAAATATATA CTGCAGGCCG AGGTATTCAG AAAAACGGGT
CTTACGGTCA GCGATCTTGC CTCTATGCTG GAAATACCCA ATCATAAACT TTCCGATCTG
TTTAACAATC ATTATAAGCT TAACTTCAAT ACCTATATCA ATAACCTGCG TGTACATTAT
GTAAAAGGGC GGCTTGATGC CGGTGAATGG AAGCAGTTTA CACTGGAAGC TATTGCACAG
GAAGCCGGGT TTTCTTCCCG CAACACTTTT CTGATCGCAT TTAAAAGGAT AATGGGGGTT
ACGCCCTCCA ATTACCTGAC AAGCCTGAAA GACAAAGCTG CTTAA
 
Protein sequence
MHLVFITIIG SIFILCLAIV KLLLFSKGRQ YLNLLLCIAI FGVIWYGIIY LLTNSGLIKN 
HPVLFNKGLP LYYLIAPCFF LYIRGSLQPD YAVFRLKYLL HLVVVIPALV AIVPYSFADT
ATQQWVVNQI DKDVAFAFSN NRYIVPNWHW FTFPLSALLY TLAQFSFAIG YAKTKKYEKK
TLTWVLAFTV ICGIIFFGML AVNLSVLSNF NDIWRILHSG RLVLFLGLAM LALSGSFFLS
PSLIFGFVRL KPASQARTAE ARSGRLEGEL HENRVKMYDT SLIVRVEKYI LQAEVFRKTG
LTVSDLASML EIPNHKLSDL FNNHYKLNFN TYINNLRVHY VKGRLDAGEW KQFTLEAIAQ
EAGFSSRNTF LIAFKRIMGV TPSNYLTSLK DKAA